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Preface 


n writing this book I have been motivated by the desire to create a 
high-quality textbook that costs almost nothing. 

The book is available on my web page for free, and the paperback 
versions (produced through an on-demand press) cost considerably less 
than comparable traditional textbooks. Any revised editions of this text 
will be issued solely for the purpose of correcting mistakes and clarifying 
exposition. New exercises may be added, but the existing ones will not be 
unnecessarily changed or renumbered. 

This text is an expansion and refinement of lecture notes I developed 
while teaching proofs courses over the past ten years. It is written for an 
audience of mathematics majors at Virginia Commonwealth University, 
a large state university. It caters to our program, and is intended to 
prepare our students for our more advanced courses. However, I am 
mindful of a larger audience. I believe this book is suitable for almost any 
undergraduate mathematics program. 


Richmond, Virginia RicHARD HAMMACK 
July 12, 2012 


Introduction 


haa is a book about how to prove theorems. 


Until this point in your education, you may have regarded mathemat- 
ics primarily as a computational discipline. You have learned to solve 
equations, compute derivatives and integrals, multiply matrices and find 
determinants; and you have seen how these things can answer practical 
questions about the real world. In this setting, your primary goal in using 
mathematics has been to compute answers. 

But there is another approach to mathematics that is more theoretical 
than computational. In this approach, the primary goal is to understand 
mathematical structures, to prove mathematical statements, and even 
to invent or discover new mathematical theorems and theories. The 
mathematical techniques and procedures that you have learned and used 
up until now have their origins in this theoretical side of mathematics. For 
example, in computing the area under a curve, you use the fundamental 
theorem of calculus. It is because this theorem is true that your answer 
is correct. However, in your calculus class you were probably far more 
concerned with how that theorem could be applied than in understanding 
why it is true. But how do we know it is true? How can we convince 
ourselves or others of its validity? Questions of this nature belong to the 
theoretical realm of mathematics. This book is an introduction to that 
realm. 

This book will initiate you into an esoteric world. You will learn and 
apply the methods of thought that mathematicians use to verify theorems, 
explore mathematical truth and create new mathematical theories. This 
will prepare you for advanced mathematics courses, for you will be better 
able to understand proofs, write your own proofs and think critically and 
inquisitively about mathematics. 
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The book is organized into four parts, as outlined below. 


PARTI Fundamentals 


e Chapter 1: Sets 

e Chapter 2: Logic 

e¢ Chapter 3: Counting 

Chapters 1 and 2 lay out the language and conventions used in all advanced 
mathematics. Sets are fundamental because every mathematical structure, 
object or entity can be described as a set. Logic is fundamental because it 
allows us to understand the meanings of statements, to deduce information 
about mathematical structures and to uncover further structures. All 
subsequent chapters will build on these first two chapters. Chapter 3 
is included partly because its topics are central to many branches of 
mathematics, but also because it is a source of many examples and exercises 
that occur throughout the book. (However, the course instructor may choose 
to omit Chapter 3.) 


PART II Proving Conditional Statements 


e Chapter 4: Direct Proof 

e Chapter 5: Contrapositive Proof 

e Chapter 6: Proof by Contradiction 

Chapters 4 through 6 are concerned with three main techniques used for 
proving theorems that have the “conditional” form “If P, then Q.” 


PART III More on Proof 


e Chapter 7: Proving Non-Conditional Statements 

e Chapter 8: Proofs Involving Sets 

e Chapter 9: Disproof 

e Chapter 10: Mathematical Induction 

These chapters deal with useful variations, embellishments and conse- 
quences of the proof techniques introduced in Chapters 4 through 6. 


PART IV Relations, Functions and Cardinality 


e Chapter 11: Relations 

e Chapter 12: Functions 

e Chapter 13: Cardinality of Sets 

These final chapters are mainly concerned with the idea of functions, which 
are central to all of mathematics. Upon mastering this material you will 
be ready for many advanced mathematics courses, such as combinatorics, 
abstract algebra, analysis and topology. 


x Introduction 


The book is designed to be covered in a fourteen-week semester. Here 
is a possible timetable. 


Week | Monday Wednesday Friday 

1 Section 1.1 Section 1.2 Sections 1.3, 1.4 

2 Sections 1.5, 1.6, 1.7 Section 1.8 Sections 1.9*, 2.1 

3 Section 2.2 Sections 2.3, 2.4 Sections 2.5, 2.6 

4 Section 2.7 Sections 2.8", 2.9 Sections 2.10, 2.11*, 2.12* 
5 Sections 3.1, 3.2 Section 3.3 Sections 3.4, 3.5* 

6 EXAM Sections 4.1, 4.2, 4.3 Sections 4.3, 4.4, 4.5* 
7 Sections 5.1, 5.2, 5.3* | Section 6.1 Sections 6.2 6.3* 

8 Sections 7.1, 7.2*, 7.3 | Sections 8.1, 8.2 Section 8.3 

9 Section 8.4 Sections 9.1, 9.2, 9.3* | Section 10.0 

10 Sections 10.0, 10.3* Sections 10.1, 10.2 EXAM 

11 Sections 11.0, 11.1 Sections 11.2, 11.3 Sections 11.4, 11.5 
12 Section 12.1 Section 12.2 Section 12.2 

13 Sections 12.3, 12.4* Section 12.5 Sections 12.5, 12.6* 
14 Section 13.1 Section 13.2 Sections 13.3, 13.4* 


Sections marked with « may require only the briefest mention in class, or 
may be best left for the students to digest on their own. Some instructors 
may prefer to omit Chapter 3. 


Acknowledgments. I thank my students in VCU’s MATH 300 courses 
for offering feedback as they read this book. Thanks especially to Cory 
Colbert and Lauren Pace for rooting out many typographical mistakes 
and inconsistencies. I am especially indebted to Cory for reading early 
drafts of each chapter and catching numerous mistakes before I posted the 
final draft on my web page. Cory also created the index, suggested some 
interesting exercises, and wrote some solutions. Thanks to Andy Lewis 
for suggesting many improvements while teaching from the book. I am 
grateful to Eleni Kanakis for creating the cover art. 

I am indebted to VCU’s Center For Teaching Excellence for awarding 
Lon Mitchell and me a CTE Small Grant for open-source textbook pub- 
lishing. Through this grant we bought the ISBN and professional fonts. 
Thanks also to series editor Lon Mitchell, whose expertise with typesetting 
and on-demand publishing made the print version of this book a reality. 

Finally, thanks to countless readers all over the world who contacted 
me concerning errors and omissions. Because of you, this is a better book. 


Part I 


Fundamentals 


CHAPTER 1 


Sets 


ll of mathematics can be described with sets. This becomes more and 

more apparent the deeper into mathematics you go. It will be apparent 

in most of your upper level courses, and certainly in this course. The 

theory of sets is a language that is perfectly suited to describing and 
explaining all types of mathematical structures. 


1.1 Introduction to Sets 


A set is a collection of things. The things in the collection are called 
elements of the set. We are mainly concerned with sets whose elements 
are mathematical entities, such as numbers, points, functions, etc. 

A set is often expressed by listing its elements between commas, en- 
closed by braces. For example, the collection {2,4,6,8} is a set which has 
four elements, the numbers 2,4,6 and 8. Some sets have infinitely many 
elements. For example, consider the collection of all integers, 


toh S899 1.019 8A... 


Here the dots indicate a pattern of numbers that continues forever in both 
the positive and negative directions. A set is called an infinite set if it 
has infinitely many elements; otherwise it is called a finite set. 

Two sets are equal if they contain exactly the same elements. Thus 
{2,4,6,8} = {4,2,8,6} because even though they are listed in a different 
order, the elements are identical; but {2,4,6,8} 4 {2,4,6,7}. Also 


{...-4,-3,-2,-1,0,1,2,3,4...} = {0,-1,1,-2,2,-3,3,-4,4,...}. 


We often let uppercase letters stand for sets. In discussing the set 
{2,4,6,8} we might declare A = {2,4,6,8} and then use A to stand for 
{2,4,6,8}. To express that 2 is an element of the set A, we write 2€ A, and 
read this as “2 is an element of A,” or “2 is in A,” or just “2 in A.” We also 
have 4€ A, 6€A and 8€A, but 5¢A. We read this last expression as “5 is 
not an element of A,” or “5 not in A.” Expressions like 6,2¢ A or 2,4,8€A 
are commonly used for indicating that several things are in a set. 


4 Sets 


Some sets are so significant and prevalent that we reserve special 
symbols for them. The set of natural numbers (i.e., the positive whole 
numbers) is denoted by N, that is, 


N = {1,2,3,4,5,6,7,...}. 
The set of integers 
Z= {...,-8,-2,-1,0,1,2,3,4,...} 


is another fundamental set. The symbol R stands for the set of all real 
numbers, a set that is undoubtedly familiar to you from calculus. Other 
special sets will be listed later in this section. 

Sets need not have just numbers as elements. The set B = {T,F} consists 
of two letters, perhaps representing the values “true” and “false.” The set 
C = {a,e,i,o,u} consists of the lowercase vowels in the English alphabet. 
The set D = {(0,0),(1,0),(0, 1),(1, J} has as elements the four corner points 
of a square on the x-y coordinate plane. Thus (0,0) €D, (1,0) €D, etc., but 
(1,2) ¢ D (for instance). It is even possible for a set to have other sets 
as elements. Consider E = {1, {2,3}, {2,4}}, which has three elements: the 
number 1, the set {2,3} and the set {2,4}. Thus le E and {2,3} ¢ E and 
{2,44e EH. But note that 2¢F, 3¢E and 4¢E. 

For yet another example, consider the set M = {[$ 3],[9],[42]} of three 
two-by-two matrices. We have [$8] «M, but [3°] ¢M. 


If X is a finite set, its cardinality or size is the number of elements 
it has, and this number is denoted as |X|. Thus for the sets above, |A| = 4, 
|B| = 2, |C|\=5, |D| =4, |E| =3 and |M|=3. 

There is a special set that, although small, plays a big role. The 
empty set is the set {} that has no elements. We denote it as @, so 9 = {}. 
Whenever you see the symbol @, it stands for {}. Observe that |¢|=0. The 
empty set is the only set whose cardinality is zero. 

Be very careful how you write the empty set. Don’t write {9} when 
you mean @. These sets can’t be equal because ¢ contains nothing while 
{¢} contains one thing, namely the empty set. If this is confusing, think 
of a set as a box with things in it, so, for example, {2,4,6,8} is a “box” 
containing four numbers. Thus the empty set @ = {} is an empty box. By 
contrast, {¢} is a box with an empty box inside it. Obviously, there’s a 
difference: An empty box is not the same as a box with an empty box 
inside it. Thus ¢ 4 {@}. (You might also observe that |¢|=0 and |{¢}|=1 
as additional evidence that @ # {¢}.) 
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This box analogy can help us think about sets. The set F = {¢, {a}, {{a}}} 
may look strange but it is really very simple. Think of it as a box containing 
three things: an empty box, a box containing an empty box, and a box 
containing a box containing an empty box. Thus |F|=3. The set G = {N, Z} 
is a box containing two boxes, the box of natural numbers and the box of 
integers. Thus |G| = 2. 

A special notation called set-builder notation is used to describe sets 
that are too big or complex to list between braces. Consider the infinite 
set of even integers E = {...,-6,—4,—2,0,2,4,6,...}. In set-builder notation 
this set is written as 

E = {2n:neZ}. 


We read the first brace as “the set of all things of form,” and the colon as 
“such that.” So the expression E = {2n:ne Z} is read as “E equals the set of 
all things of form 2n, such that n is an element of Z.” The idea is that E 
consists of all possible values of 2n, where n is takes on all values in Z. 
In general, a set X written with set-builder notation has the syntax 


X = {expression : rule}, 


where the elements of X are understood to be all values of “expression” 
that are specified by “rule.” For example, the set EF above is the set 
of all values the expression 2n that satisfy the rule ne Z. There can 
be many ways to express the same set. For example, E = {2n Ine z} = 
{n :n is an even integer} = {n:n =2k,k € Z$}. Another common way of 
writing it is 

E={néZ:n is even}, 
read as “E is the set of all n in Z such that n is even.” Some writers use 
a bar instead of a colon; for example, E = {ne€ Z|n is even}. We use the 
colon. 
Example 1.1 Here are some further illustrations of set-builder notation. 
. {n:n is a prime number} = {2,3,5,7,11,13,17,...} 
. {neN:n is prime} = {2,3,5,7,11,13,17,...} 
. {n?:n€ Z} = {0,1,4,9,16,25,...} 
. {xe R:x2-2=0} ={V2,-V2} 
. {xe Z:x2?-2=O0=¢G 
. {xe Z: |x| < 4} = {-3,-2,-1,0,1,2,3} 
. {2x +x € Z,|x| <4} = {-6,-4,-2,0,2,4,6} 
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8. {xe Z:|2x| <4} ={-1,0,1} 


These last three examples highlight a conflict of notation that we must 
always be alert to. The expression |X| means absolute value if X is a 
number and cardinality if X is a set. The distinction should always be 
clear from context. In the |x| in Example 6 above, we have x€ Z, so x is 
a number (not a set), and thus the bars in |x| must mean absolute value, 
not cardinality. On the other hand, consider A = {{1,2},{3,4,5,6},{7}} and 
B={X€A:|X|<3}. The elements of A are sets (not numbers), so the |X| 
in the expression for B must mean cardinality. Therefore B = {{1,2},{7}}. 

We close this section with a summary of special sets. These are sets or 
types of sets that come up so often that they are given special names and 
symbols. 


¢ The empty set: ¢ = {} 

¢ The natural numbers: N= {1,2,3,4,5,...} 

e The integers: Z={...,-3,-2,-1,0,1,2,3,4,5,...} 

¢ The rational numbers: Q= {x : x= where m,ne€Z and n¢ 0} 

¢ The real numbers: R_ (the set of all real numbers on the number line) 


Notice that Q is the set of all numbers that can be expressed as a fraction 

of two integers. You are surely aware that QR, as V2¢Q but V2ER. 
There are some other special sets that you will recall from your study 

of calculus. Given two numbers a,b€R with a <b, we can form various 

intervals on the number line. 

* Closed interval: [a,b]={xeR:a<x<b} 

¢ Half open interval: (a,b]={xeR:a<x<b} 

e Half open interval: [a,b)={xeR:asx<b} 

¢ Open interval: (a,b)={xeR:a<x<b} 

¢ Infinite interval: (a,co)= {xe R:a<x} 

¢ Infinite interval: [a,co)= {xe R:a<x} 

¢ Infinite interval: (—oo,b) = {xe R:x <b} 

¢ Infinite interval: (—oo,b] = {xe R:x <b} 

Remember that these are intervals on the number line, so they have in- 

finitely many elements. The set (0.1,0.2) contains infinitely many numbers, 

even though the end points may be close together. It is an unfortunate 

notational accident that (a,b) can denote both an interval on the line and 

a point on the plane. The difference is usually clear from context. In the 

next section we will see still another meaning of (a,b). 
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Exercises for Section 1.1 
A. Write each of the following sets by listing their elements between braces. 


1. {5x-1:x€Z} 9. {xe R:sinax = 0} 
2. {8x+2:x€ Z} 10. {x€R:cosx = 1} 
3. {xe Z:-2<x<7} 11. {xe Z: |x| <5} 

4. {xeN:-2<x<7} 12. {xe Z:|2x| <5} 
5. {xe R: x? =3} 13. {xe Z:|6x| <5} 
6. {xER:x?=9} 14, {5x:x€ Z,|2x| < 8} 
7. {xe€R:x2+5x =-6} 15. {5a+2b:a,beZ} 
8. {xe R:x3 + 5x? = —6x} 16. {6a+2b:a,beZ} 


B. Write each of the following sets in set-builder notation. 


17. {2,4,8,16,32,64...} 23. {3,4,5,6,7,8} 

18. {0,4,16, 36,64, 100,...} 4, {-4,-3, i —1,0,1,2} 

19. {...,-6,-3,0,3,6,9, 12, 15,...} me a , a 49> 1,2,4,8,...} 

20. {...,-8,-3,2,7,12,17,...} 26. {... ab basoan. z 

21. {0,1,4,9, 16,25, 86,.. J 27. {...,-1,-4,0, 3,2, 32, 2m, 82,...} 
22. {3,6,11,18,27,38,...} 28. {... nae 3 0,3,3,3,3,3,2,...} 


C. Find the following cardinalities. 


29. |{{1}, {2, {3,4}}, }| 34. |{x EN: |x| < 10} 
30. |{{1,4},4,0,{{3, 4h}, 1o}}| 35. |{xeZ:x7 < 10}| 
B1. |{{{1}, {2, {3,4}, ot} | 36. [{xeN: x? < 10} 
32. |{{{1,4}.0,0, {13,4}, 1} th 37. |{xeN:x? < 0} 
33. |{x€ Z: |x| < 10}| 38. |{xeN:5x < 20} 


D. Sketch the following sets of points in the x-y plane. 
39. {(x,y):x€[1,2],y €[1,2]} 46. {(x,y):x,yER,x2+y? <1} 

40. {(x,y):x€[0,1],y €[1,2]} AT. {(x,y):x,yeER,y =x7-1} 

Al. {(x,y):x€[-1,1],y=1} 48. {(x,y):x,y€R,x>1} 
{(x,y):x =2,y €[0, 1]} 49. {(x,x+y):xER, ye Z} 

43. {(x,y): |x| =2,y€ [0, 11} 50. {(x,):2€ Ry €N} 
{(x,x*):x€ R} Bl. {(x,y) eR? : (y—x)(y+x)=0} 
{(x,y):x,y €R,x? + y? = 1} 52. {(x,y)eR? : (y—x2)(y +x?) = 0} 
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1.2 The Cartesian Product 


Given two sets A and B, it is possible to “multiply” them to produce a new 
set denoted as A x B. This operation is called the Cartesian product. To 
understand it, we must first understand the idea of an ordered pair. 


Definition 1.1 An ordered pair is a list (x,y) of two things x and y, 
enclosed in parentheses and separated by a comma. 


For example, (2,4) is an ordered pair, as is (4,2). These ordered pairs 
are different because even though they have the same things in them, 
the order is different. We write (2,4) 4 (4,2). Right away you can see that 
ordered pairs can be used to describe points on the plane, as was done in 
calculus, but they are not limited to just that. The things in an ordered 
pair don’t have to be numbers. You can have ordered pairs of letters, such 
as (m,¢), ordered pairs of sets such as ({2,2},{3,2}), even ordered pairs 
of ordered pairs like ((2,4),(4,2)). The following are also ordered pairs: 
(2, {1,2,3}), (R,(0,0)). Any list of two things enclosed by parentheses is an 
ordered pair. Now we are ready to define the Cartesian product. 


Definition 1.2. The Cartesian product of two sets A and B is another 
set, denoted as Ax B and defined as Ax B= {(a,b):a€ A,beB}. 


Thus A xB is a set of ordered pairs of elements from A and B. For 
example, if A = {k,¢,m} and B= {q,r}, then 


Ax B= {(k,q), (kr), (2,q),(€,7),(m,q),(m,r)}. 


Figure 1.1 shows how to make a schematic diagram of A x B. Line up the 
elements of A horizontally and line up the elements of B vertically, as if A 
and B form an x- and y-axis. Then fill in the ordered pairs so that each 
element (x,y) is in the column headed by x and the row headed by y. 


AxB 
(kr) (é,r) (m,r) 


(k,q) (,q) (m,q) 


Figure 1.1. A diagram of a Cartesian product 
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For another example, {0,1} x {2,1} = {(0,2),(0,1),(1,2),(1,1)}. If you are 
a visual thinker, you may wish to draw a diagram similar to Figure 1.1. 
The rectangular array of such diagrams give us the following general fact. 


Fact 1.1 If A and B are finite sets, then |A x B| =|A|-|BI. 


The set Rx R= {(x,y):x,y eR} should be very familiar. It can be viewed 
as the set of points on the Cartesian plane, and is drawn in Figure 1.2(a). 
The set Rx N= {(x,y):x€R,yeN} can be regarded as all of the points on 
the Cartesian plane whose second coordinate is a natural number. This 
is illustrated in Figure 1.2(b), which shows that R x N looks like infinitely 
many horizontal lines at integer heights above the x axis. The set Nx N 
can be visualized as the set of all points on the Cartesian plane whose 
coordinates are both natural numbers. It looks like a grid of dots in the 
first quadrant, as illustrated in Figure 1.2(c). 


y y 


(a) (b) (c) 
Figure 1.2. Drawings of some Cartesian products 


It is even possible for one factor of a Cartesian product to be a Cartesian 
product itself, as in Rx (Nx Z) = {(x,(y,z)):x€R,(y,z)ENx Z}. 

We can also define Cartesian products of three or more sets by moving 
beyond ordered pairs. An ordered triple is a list (x,y,z). The Cartesian 
product of the three sets R, N and Z is Rx Nx Z= {(x,y,z):x€R, yEN, ze Z}. 
Of course there is no reason to stop with ordered triples. In general, 


Ay x Agx-++x An = {(x1,%2,...,%n) 1x; € A; for each i =1,2,...,n}. 


Be mindful of parentheses. There is a slight difference between Rx(NxZ) 
and RxN~x Z. The first is a Cartesian product of two sets; its elements 
are ordered pairs (x,(y,z)). The second is a Cartesian product of three 
sets; its elements look like (x,y,z). To be sure, there will be situations in 
which there is no harm in blurring the distinction between expressions 
like (x,(y,z)) and (x,y,z), but for now we consider them as different. 
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We can also take Cartesian powers of sets. For any set A and positive 
integer n, the power A” is the Cartesian product of A with itself n times: 


AT =AxAx-+xXA={(x1,%9,...,%n) 5 1,%2,...,%n © A}. 


In this way, R? is the familiar Cartesian plane and R? is three-dimensional 
space. You can visualize how, if R? is the plane, then Z? = {(m,n):m,ne Z} 
is a grid of points on the plane. Likewise, as R® is 3-dimensional space, 
Z? = {(m,n,p):m,n,p € Z} is a grid of points in space. 

In other courses you may encounter sets that are very similar to R”, 
but yet have slightly different shades of meaning. Consider, for example, 
the set of all two-by-three matrices with entries from R: 


M=}[% 52]: u,v,w,x,y¥,zER}. 
This is not really all that different from the set 
R® = {(u,v,w,x,y,Z) : u,v,W,x,y,z € R}. 


The elements of these sets are merely certain arrangements of six real 
numbers. Despite their similarity, we maintain that M 4 R°, for two-by- 
three matrices are not the same things as sequences of six numbers. 


Exercises for Section 1.2 


A. Write out the indicated sets by listing their elements between braces. 
1. Suppose A = {1,2,3,4} and B= {a,c}. 


(a) AxB (c) AxA (e) @xB (g) Ax(BxB) 
(b) Bx A (d) BxB (f) (AxB)xB (h) B? 

2. Suppose A = {z,e,0} and B= {0,1}. 
(a) AxB (c) AxA (e) Ax@ (g) Ax(BxB) 
(b) Bx A (d) BxB (f) (Ax B)xB (h) AxBxB 

3. {xe R:x? = 2} x {a,c,e} 6. {xER:x? =x} x {xeN:x? =x} 

4. {neZ:2<n<5}x {neZ:|n|=5} 7. {o} x {0, a} x {0, 1} 

5. {xe Rix? =2) x {x eR: |x| = 2} 8. {0,1}4 

B. Sketch these Cartesian products on the x-y plane R? (or R® for the last two). 

9. {1,2,3} x {-1,0,1} 15. {1} x [0,1] 

10. {-1,0,1} x {1,2,3} 16. [0,1] x {1} 

11. [0,1] x [0,1] 17. NxZ 

12. [-1,1]x [1,2] 18. ZxZ 

13. {1,1.5,2} x [1,2] 19. [0,1] x [0,1] [0,1] 


14. [1,2] x {1,1.5,2} 20. {(x,y)eR?:x? + y? <1} x [0,1] 
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1.3 Subsets 


It can happen that every element of some set A is also an element of 
another set B. For example, each element of A = {0,2,4} is also an element 
of B = {0,1,2,3,4}. When A and B are related this way we say that A is a 
subset of B. 


Definition 1.3. Suppose A and B are sets. If every element of A is also 
an element of B, then we say A is a subset of B, and we denote this as 
ACB. We write AZB if A is not a subset of B, that is, if it is not true 
that every element of A is also an element of B. Thus A ¢B means that 
there is at least one element of A that is not an element of B. 


Example 1.2 Be sure you understand why each of the following is true. 
{23,7} < {2,3,4,5,6,7} 

. {2,3,7} Z {2,4,5,6, 7} 

. {2,3,7} < {23,7 

. {Qn:neZ}oZ 

. {(x,sin(x)) : xe R} CR? 

: {2,3,5,7,11,18,17,...} CN 

NEZCOCR 

RxNCRxR 


OANA TAR WN 


This brings us to a significant fact: If B is any set whatsoever, then 
@ CB. To see why this is true, look at the last sentence of Definition 1.3. 
It says that ¢ ¢B would mean that there is at least one element of @ 
that is not an element of B. But this cannot be so because @ contains no 
elements! Thus it is not the case that @ ZB, so it must be that @ CB. 


Fact 1.2 The empty set is a subset of every set, that is, @ ¢ B for any set B. 


Here is another way to look at it. Imagine a subset of B as a thing you 
make by starting with braces {}, then filling them with selections from B. 
For example, suppose B = {a,b,c}. To make one particular subset of B, 
start with {}, select b and c from B and insert them into {} to form the 
subset {b,c}. Alternatively, you could have chosen a and b to make {a,b}, 
and so on. But one option is to simply select nothing from B. This leaves 
you with the subset {}. Thus {}<B. More often we write it as CB. 
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This idea of “making” a subset can help us list out all the subsets of 
a given set B. As an example, let B= {a,b,c}. Let’s list all of its subsets. 
One way of approaching this is to make a tree-like structure. Begin with 
the subset {}, which is shown on the left of Figure 1.3. Considering the 
element a of B, we have a choice: insert it or not. The lines from {} point 
to what we get depending whether or not we insert a, either {} or {a}. Now 
move on to the element b of B. For each of the sets just formed we can 
either insert or not insert b, and the lines on the diagram point to the 
resulting sets {}, {b},{a}, or {a,b}. Finally, to each of these sets, we can 
either insert c or not insert it, and this gives us, on the far right-hand 
column, the sets {}, {c}, {b}, {b,c}, {a}, {a,c}, {a,b} and {a,b,c}. These are 
the eight subsets of B = {a,b,c}. 


Insert a? Insert b? Insert c? 


atom 
me — fo} 
wes b 
ae ant a 
eo 5 
7 | _— No iy 
Yes pe eens 
a ma 
—~ Yes _— {a,b} 
Cr ae 


Figure 1.3. A “tree” for listing subsets 


We can see from the way this tree branches out that if it happened that 
B= {a}, then B would have just two subsets, those in the second column 
of the diagram. If it happened that B = {a,b}, then B would have four 
subsets, those listed in the third column, and so on. At each branching of 
the tree, the number of subsets doubles. Thus in general, if |B| =n, then 
B must have 2” subsets. 


Fact 1.3 Ifa finite set has n elements, then it has 2” subsets. 
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For a slightly more complex example, consider listing the subsets of 
B = {1,2,{1,3}}. This B has just three elements: 1, 2 and {1,3}. At this 
point you probably don’t even have to draw a tree to list out B’s subsets. 
You just make all the possible selections from B and put them between 
braces to get 


Ue ip 12h HLS thf tLiL3y}, 124133}, 112113}. 


These are the eight subsets of B. Exercises like this help you identify what 
is and isn’t a subset. You know immediately that a set such as {1,3} is not 
a subset of B because it can’t be made by selecting elements from B, as 
the 3 is not an element of B and thus is not a valid selection. 


Example 1.3 Be sure you understand why the following statements are 
true. Each illustrates an aspect of set theory that you’ve learned so far. 


1s ETAL) bic 06s keine aor eee ee sane wants 1 is the first element listed in {1,{1}} 
Be NEN ATs Sick 5 ree nea seseeinarh aacncacns Sha enna cet Pace pahpcnaten ty wa because 1 is not a set 
BAe Pig Ie cost ainag nes exn gly scnne Pees {1} is the second element listed in {1, {1}} 
Ae Ae Wea Vt as eee aed od chic Satna make subset {1} by selecting 1 from {1, {1}} 
Be {eth {a caeewt vas because {1,{1}} contains only 1 and {1}, and not {{1}} 
Gy LS A ie ees eats make subset {{1}} by selecting {1} from {1, {1}} 
Po INE Newevetets because N is a set (not a number) and N contains only numbers 
SB AINE NE suid a heres eh Bete a od Bh oD Ae as because X < X for every set X 
9. GEN... eee eee eee because the set N contains only numbers and no sets 
TOs DS Nini cdots occ B see Reveal ire edtensth dae ae, ain 8 because ¢ is a subset of every set 
Pct BIN tak car ratettiet spect mea esey because {N} has just one element, the set N 
De IN sca Panece concn Castine aeseah gs otar wine! ahie because, for instance, 1e N but 1¢ {N} 
D7 (DEIN be pececne up areca we note that the only element of {N} is N, and N4@ 
TAY IOAN Fe ori a Ny e rae chee cna an ea ueaeGay because ¢ is a subset of every set 
15 De tO cecal ea ees @ is the first element listed in {o,N} 
DOD SAO res es et ati ce ead es eet tien because @ is a subset of every set 
Pie NETO et ete he teu Bitat 2h Ne make subset {N} by selecting N from {¢,N} 
TS PAN EEA ANS 10 coke islet alate eaeddauW ammonia dae tase because N¢ {@, {N}} 
192 INF EAD ANY oe ctu nme taeetunetcs cat {N} is the second element listed in {@, {N}} 
20. {(1,2),(2,2),(7, DPONXN. oc ceceeeeeee eee each of (1,2), (2,2), (7,1) is in NxN 


Though they should help you understand the concept of subset, the 
above examples are somewhat artificial. But in general, subsets arise very 
naturally. For instance, consider the unit circle C = {(x,y)€R?:x?+y? = 1}. 
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This is a subset CCR”. Likewise the graph of a function y= f(x) is a set 
of points G = {(x, f(x)):xeR}, and GCR?. You will surely agree that sets 
such as C and G are more easily understood or visualized when regarded 
as subsets of R?. Mathematics is filled with such instances where it is 
important to regard one set as a subset of another. 


Exercises for Section 1.3 
A. List all the subsets of the following sets. 


1. {1,2,3,4} 5. {o} 
2. {1,2,o} 6. {R,Q,N} 
3. {{R}} 7. {R{Q.N}} 
4. % 8. {{0,1},{0,1, {2}}, {o}} 
B. Write out the following sets by listing their elements between braces. 
9. {X:X¢{3,2,a} and |X| =2} 11. {X:X¢ {3,2,a} and |X| =4} 
10. {X CN: |X| <1} 12. {x:X¢{3,2,a} and |X|=1} 
C. Decide if the following statements are true or false. Explain. 
13. R° CR? 15. {(x,y):x-1=0}¢ {(x, y):x2-x =0} 
14. R?cR? 16. {(x,y):x?-x=0}¢ {(x,y):x-1=0} 


1.4 Power Sets 


Given a set, you can form a new set with the power set operation, defined 
as follows. 


Definition 1.4 If A is a set, the power set of A is another set, denoted 
as Y(A) and defined to be the set of all subsets of A. In symbols, Y(A) = 
{Xx 1X cA}. 


For example, suppose A = {1,2,3}. The power set of A is the set of all 
subsets of A. We learned how to find these in the previous section, and 
they are {}, {1}, {2}, {3}, {1,2}, {1,3}, {2,3} and {1,2,3}. Therefore the power 
set of A is 


P(A) ={ o, {1}, {2}, {3}, {1,2}, {1,3}, {2,3}, {12,3} }. 


As we saw in the previous section, if a finite set A has n elements, then 
it has 2” subsets, and thus its power set has 2” elements. 


Power Sets 15 


Fact 1.4 If A is a finite set, then | A(A)| =2'4!. 


Example 1.4 You should examine the following statements and make 
sure you understand how the answers were obtained. In particular, notice 
that in each instance the equation |A(A)| =2'4! is true. 


- P({0,1,3}) = 1, {0}, {1}, {3}, {0,1}, {0,3}, {13}, {0,1,3} } 
P({L2})={9, tp {2}, {L2}} 
- A({t) tl 


{(G,H), (,{H}), (la}.9), (taf. {o}) } 
©, {D}, fox {D1 Dh} } 
J={0 {1p {{1,2}}, {1.41.2} 
10. A>({Z,N}) ={, {Z}, {N}, {ZN} } 
Next are some that are wrong. See if you can determine why they are wrong 
and make sure you understand the explanation on the right. 


CHNA TAR OD 
=a 
a 
“~~ 
— 


iT 22 Gb eee re ek eee re meaningless because 1 is not a set 
12: PAT AL OE) SAO ATL Ot ON ct ee wrong because {1,2} ¢ {1, {1,23} 
13. PA({1,{1,2}}) = {o, {{1P {{1, 2b {a {Lathe ... wrong because {{1}} ¢ {1, {1,2}} 


If A is finite, it is possible (though maybe not practical) to list out A(A) 
between braces as was done in Examples 1-10 above. That is not possible 
if A is infinite. For example, consider A(N). You can start writing out the 
answer, but you quickly realize N has infinitely many subsets, and it’s not 
clear how (or if) they could be arranged as a list with a definite pattern: 


AN) = {6, {1}, {2},..., {1,2}, {1,3},..., {39,47}, 
ASST 1B AO 4G Boch? nat 


The set A(R?) is mind boggling. Think of R? = {(x, y):x,y€R} as the set 
of all points on the Cartesian plane. A subset of R? (that is, an element 
of Y(R7)) is a set of points in the plane. Let’s look at some of these sets. 
Since {(0,0),(1,1)} CR’, we know that {(0,0),(1,D} « A(R?). We can even 
draw a picture of this subset, as in Figure 1.4(a). For another example, 
the graph of the equation y= x" is the set of points G = {(x,x”):x eR} and 
this is a subset of R?, so Ge A(R). Figure 1.4(b) is a picture of G. Since 
this can be done for any function, the graph of every imaginable function 
f :R—R can be found inside of A(R?). 
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y y J 
. INFINITE 
x x x 
(a) (b) (c) 


Figure 1.4. Three of the many, many sets in A(R?) 


In fact, any black-and-white image on the plane can be thought of as a 
subset of R?, where the black points belong to the subset and the white 
points do not. So the text “INFINITE” in Figure 1.4(c) is a subset of R? 
and therefore an element of “A(R?). By that token, A(R?) contains a copy 
of the page you are reading now. 

Thus in addition to containing every imaginable function and every 
imaginable black-and-white image, “(R*) also contains the full text of 
every book that was ever written, those that are yet to be written and 
those that will never be written. Inside of A(R?) is a detailed biography of 
your life, from beginning to end, as well as the biographies of all of your 
unborn descendants. It is startling that the five symbols used to write 
(R?) can express such an incomprehensibly large set. 

Homework: Think about A(A(R?)). 


Exercises for Section 1.4 
A. Find the indicated sets. 


1. P({{a,b}, {e}}) 7. P({a,b}) x P({0,1}) 

2. P({1,2,3,4}) 8. P({1,2} x {3}) 

3. P({{O},5}) 9. P ({a,5} x {0}) 

4. PA ({R,Q}) 10. {Xe A({1,2,3}) |x| <1} 

5. P(A ({2})) 11. {X ¢ A({1,2,3}) :|X| <1} 

6. P({1,2}) x A({3}) 12. {X « P({1,2,3}):2€ xX} 
B. Suppose that |A|=m and |B|=n. Find the following cardinalities. 

13. |A(APAA)))| 17. |[{X « P(A): |X| <1} 

14. | A(A(A))| 18. | A(A x A(B))| 

15. | PA(A x B)| 19. |A(A(AA x 9)))| 


16. | A(A)x AB) 20. |{X ¢ P(A) :|X| <1}| 
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1.5 Union, Intersection, Difference 


Just as numbers are combined with operations such as addition, subtrac- 
tion and multiplication, there are various operations that can be applied to 
sets. The Cartesian product (defined in Section 1.2) is one such operation; 
given sets A and B, we can combine them with x to get a new set Ax B. 
Here are three new operations called union, intersection and difference. 


Definition 1.5 Suppose A and B are sets. 
The union of A and B is the set AUB={x:xeA or xeB}. 
The intersection of A and B is the set ANB={x:xeA and xe B}. 
The difference of A and Bisthe set A-B={x:xeA and x¢B}. 


In words, the union AUB is the set of all things that are in A or in B 
(or in both). The intersection ANB is the set of all things in both A and B. 
The difference A-B is the set of all things that are in A but not in B. 


Example 1.5 Suppose A = {a,b,c,d,e}, B= {d,e,f} and C = {1,2,3}. 


. AUB= {a,b,c,d,e,f} 
_ ANB={d,e} 

. A-B= {a,b,c} 
Bot 


. (A-B)U(B-A)= {a,b,c, f} 

. AUC= {a,b,c,d,e,1,2,3} 

ANC=@ 

. A-C= {a,b,c,d,e} 

. (ANC)U(A-C)= {a,b,c,d,e} 

10. (ANB) B= {(d,d),(d,e),(d, f),(e,d),(e,e),(e, f)} 

11. (AxC)n(Bx C)= {(d, 1), (d, 2), (d, 3), (e, 1), (e, 2), (e,3)} 


CoONarmbhwn 


Observe that for any sets X and Y it is always true that X UY =YUX 
and XnNY =YnX, but in general X-Y 4Y-X. 

Continuing the example, parts 12-15 below use the interval notation 
discussed in Section 1.1, so [2,5] = {xe R:2<x<5}, etc. Sketching these 
examples on the number line may help you understand them. 

12. [2,5] U[8,6] = [2,6] 
13. [2,5] [3,6] =[3,5] 
14. [2,5]—[3,6] =[2,3) 
15. [0,3]—[1,2] =[0,1)U(2,3] 
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t tAUB 


Sets 


(a) (b) 


(d) 


Figure 1.5. The union, intersection and difference of sets A and B 


Example 1.6 Let A = {(x,x?):x eR} be the graph of the equation y = x 
and let B= {(x,x+2):x €R} be the graph of the equation y = x+2. These sets 
are subsets of R?. They are sketched together in Figure 1.5(a). Figure 1.5(b) 
shows AUB, the set of all points (x,y) that are on one (or both) of the two 
graphs. Observe that AnB = {(-1,1),(2,4)} consists of just two elements, 
the two points where the graphs intersect, as illustrated in Figure 1.5(c). 
Figure 1.5(d) shows A—B, which is the set A with “holes” where B crossed it. 
In set builder notation, we could write AUB = {(x,y):xe€R,y =x? or y=x+2} 


and A-B= {(x,x”):xeR-{-1,2}}. 


Exercises for Section 1.5 


1. Suppose A = {4,3,6,7,1,9}, B = {5,6,8,4} and C = {5,8,4}. Find: 


(a) AUB (d) A-C (g) BNC 
(b) ANB (e) B-A (h) BUC 
(c) A-B (f) ANC (i) C-B 
2. Suppose A = {0,2,4,6,8}, B = {1,3,5,7} and C = {2,8,4}. Find: 
(a) AUB (d) A-C (g) BNC 
(b) ANB (e) B-A (h) C-A 
(c) A-B (f) ANC (i) C-B 
3. Suppose A = {0,1} and B= {1,2}. Find: 
(a) (Ax B)n(B xB) (d) (AnB)xA (g) P(A)- AB) 
(b) (A x B)U(B x B) (e) (AxB)nB (h) AAnB) 
(c) (Ax B)—(BxB) (f) AA)n ACB) (i) A(AxB) 
4. Suppose A = {b,c,d} and B= {a,b}. Find: 
(a) (Ax B)n(BxB) (d) (AnB)xA (g) A(A)- AB) 
(b) (A x B)U(B x B) (e) (AxB)nB (h) AAnB) 


(c) (Ax B)-(B xB) (f) A(A)n AB) 


(i) P(A)x AB) 
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5. Sketch the sets X =[1,3]x[1,3] and Y =[2,4]x[2,4] on the plane R?. On separate 
drawings, shade in the sets X UY, XnY, X-Y and Y —X. (Hint: X and Y are 
Cartesian products of intervals. You may wish to review how you drew sets 
like [1,3] x [1,3] in the exercises for Section 1.2.) 

6. Sketch the sets X = [—1,3]x [0,2] and Y = [0,3] [1,4] on the plane R?. On 
separate drawings, shade in the sets X UY, XNY, X-Y and Y-X. 

7. Sketch the sets X = {(x, y)€ R?:x7+y? <1} and Y = {(x,y)e R?:x=0} on R?. On 
separate drawings, shade in the sets X UY, XNY, X-Y and Y-X. 

8. Sketch the sets X = {(x,y)€ R?:x7+y? <1} and Y = {(x,y)eR?:-1<y<0} on R?. 
On separate drawings, shade in the sets X UY, XNY, X-Y and Y-X. 

9. Is the statement (Rx Z)n(Z xR) = Zx Z true or false? What about the statement 
(Rx Z)U(Z x R)=RxR? 

10. Do you think the statement (R-Z) x N=(RxN)-—(ZxN) is true, or false? Justify. 


1.6 Complement 


This section introduces yet another set operation, called the set complement. 
The definition requires the idea of a universal set, which we now discuss. 

When dealing with a set, we almost always regard it as a subset 
of some larger set. For example, consider the set of prime numbers 
P = {2,3,5,7,11,13,...}. If asked to name some things that are not in P, we 
might mention some composite numbers like 4 or 6 or 423. It probably 
would not occur to us to say that Vladimir Putin is not in P. True, Vladimir 
Putin is not in P, but he lies entirely outside of the discussion of what is 
a prime number and what is not. We have an unstated assumption that 


PCN 


because N is the most natural setting in which to discuss prime numbers. 
In this context, anything not in P should still be in N. This larger set N is 
called the universal set or universe for P. 

Almost every useful set in mathematics can be regarded as having 
some natural universal set. For instance, the unit circle is the set C = 
{(x,y) € R? : x2 + y? = 1}, and since all these points are in the plane R? it is 
natural to regard R? as the universal set for C. In the absence of specifics, 
if A is a set, then its universal set is often denoted as U. We are now 
ready to define the complement operation. 


Definition 1.6 Let A be a set with a universal set U. The complement 
of A, denoted A, is the set A=U-A. 
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Example 1.7 If P is the set of prime numbers, then 


P=N-P = ({1,4,6,8,9,10,12,...}. 


Thus P is the set of composite numbers and 1. 


Example 1.8 Let A = {(x,x?):x€R} be the graph of the equation y= x”. 
Figure 1.6(a) shows A in its universal set R?. The complement of A is A = 
R?-A = {(x,y)eR*: y 4x7}, illustrated by the shaded area in Figure 1.6(b). 


A 
(a) (b) 


Figure 1.6. A set and its complement 


Exercises for Section 1.6 


1. 


Let A = {4,3,6,7,1,9} and B= {5,6,8,4} have universal set U = {ne Z:0<n<10}. 


(a) A (d) AUA (g) A-B 
(b) B (e) A-A (h) AnB 
(c) ANA (f) A-B (i) ANB 
. Let A= {0,2,4,6,8} and B = {1,3,5,7} have universal set U= {ne Z:0<n<8}. 
(a) A (d) AUA (g) ANB 
(b) B (e) A-A (h) AnB 
(c) ANA (f) AUB (i) AxB 


. Sketch the set X =[1,3]x [1,2] on the plane R?. On separate drawings, shade in 


the sets X and Xn ((0,2] x [0,3]). 


. Sketch the set X =[-1,3] x [0,2] on the plane R?. On separate drawings, shade 


in the sets X and Xn ({-2,4] x [-1,3)). 


. Sketch the set X = {(x,y)€ R?:1<x?+y? <4} on the plane R?. On a separate 


drawing, shade in the set X. 


. Sketch the set X = {(x,y)€R?: y<x?} on R?. Shade in the set X. 
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1.7 Venn Diagrams 


In thinking about sets, it is sometimes helpful to draw informal, schematic 
diagrams of them. In doing this we often represent a set with a circle 
(or oval), which we regard as enclosing all the elements of the set. Such 
diagrams can illustrate how sets combine using various operations. For 
example, Figures 1.7(a—c) show two sets A and B which overlap in a 
middle region. The sets AUB, AnB and A-B are shaded. Such graphical 
representations of sets are called Venn diagrams, after their inventor, 
British logician John Venn, 1834-1923. 


A BA BA B 
(a) (b) (c) 


Figure 1.7. Venn diagrams for two sets 


Though you are not likely to draw Venn diagrams as a part of a proof 
of any theorem, you will probably find them to be useful “scratch work” 
devices that help you to understand how sets combine, and to develop 
strategies for proving certain theorems or solving certain problems. The 
remainder of this section uses Venn diagrams to explore how three sets 
can be combined using U and n. 

Let’s begin with the set AUBUC. Our definitions suggest this should 
consist of all elements which are in one or more of the sets A, B and C. 
Figure 1.8(a) shows a Venn diagram for this. Similarly, we think of ANBNC 
as all elements common to each of A, B and C, so in Figure 1.8(b) the 
region belonging to all three sets is shaded. 


A B A B 
AUBUC ANBnC 


(a) (b) 


Figure 1.8. Venn diagrams for three sets 
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We can also think of AN BNC as the two-step operation (AN B)nC. In 
this expression the set AnB is represented by the region common to both 
A and B, and when we intersect this with C we get Figure 1.8(b). This is a 
visual representation of the fact that AN BNC = (AnB)nC. Similarly, we 
have ANBnNC= An(BnC). Likewise, AUBUC = (AUB)UC= AU(BUC). 

Notice that in these examples, where the expression either contains 
only the symbol u or only the symbol n, the placement of the parentheses 
is irrelevant, so we are free to drop them. It is analogous to the situations 
in algebra involving expressions (a+ b)+c= a+(b+c) or (a-b)-c=a-(b-c). 
We tend to drop the parentheses and write simply a+b+c or a-b-c. By 
contrast, in an expression like (a+b)-c the parentheses are absolutely 
essential because (a+ b)-c and a+(b-c) are generally not equal. 

Now let’s use Venn diagrams to help us understand the expressions 
(AUB)NC and AU(BNC), which use a mix of U and n. Figure 1.9 shows 
how to draw a Venn diagram for (AUB)NC. In the drawing on the left, the 
set AUB is shaded with horizontal lines, while C is shaded with vertical 
lines. Thus the set (A UB)NC is represented by the cross-hatched region 
where AUB and C overlap. The superfluous shadings are omitted in the 
drawing on the right showing the set (AUB)NC. 


pan 


wy 
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Figure 1.9. How to make a Venn diagram for (AUB)NC 


Now think about AU(BnC). In Figure 1.10 the set A is shaded with 
horizontal lines, and BNC is shaded with vertical lines. The union AU(BNC) 
is represented by the totality of all shaded regions, as shown on the right. 


C 
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Figure 1.10. How to make a Venn diagram for AU(BNC) 
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Compare the diagrams for (AUB)NC and AU(BNC) in Figures 1.9 and 
1.10. The fact that the diagrams are different indicates that (AUB)NC # 
AU(BnC) in general. Thus an expression such as AUBNC is absolutely 
meaningless because we can’t tell whether it means (AUB)NC or AU(BNC). 
In summary, Venn diagrams have helped us understand the following. 


Important Points: 


e If an expression involving sets uses only U, then parentheses are optional. 
e If an expression involving sets uses only n, then parentheses are optional. 
e If an expression uses both U and n, then parentheses are essential. 


In the next section we will study types of expressions that use only U 
or only n. These expressions will not require the use of parentheses. 


Exercises for Section 1.7 

. Draw a Venn diagram for A. 

Draw a Venn diagram for B-A. 

Draw a Venn diagram for (A—B)NC. 

. Draw a Venn diagram for (AUB)-C. 

. Draw Venn diagrams for AU(BNC) and (AUB)n(AUC). Based on your drawings, 

do you think AU(BNC) = (AUB)n(AUC)? 

6. Draw Venn diagrams for An(BUC) and (AnB)U(AnC). Based on your drawings, 
do you think An(BUC) = (AnB)U(AnC)? 

7. Suppose sets A and B are in a universal set U. Draw Venn diagrams for . AnB 
and AUB. Based on your drawings, do you think it’s true that ANB = AUB? 


8. Suppose sets A and B are in a universal set U. Draw Venn diagrams for AUB 
and AnB. Based on your drawings, do you think it’s true that AUB = AnB? 


9. Draw a Venn diagram for (AnB)-C. 
10. Draw a Venn diagram for (A—B)UC. 


Following are Venn diagrams for expressions involving sets A,B and C. Write the 
corresponding expression. 


C C C C 
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1.8 Indexed Sets 


When a mathematical problem involves lots of sets, it is often convenient to 
keep track of them by using subscripts (also called indices). Thus instead 
of denoting three sets as A,B and C, we might instead write them as Aj, A2 
and As. These are called indexed sets. 

Although we defined union and intersection to be operations that 
combine two sets, you by now have no difficulty forming unions and 
intersections of three or more sets. (For instance, in the previous section 
we drew Venn diagrams for the intersection and union of three sets.) 
But let’s take a moment to write down careful definitions. Given sets 
Aj,Ag9,...,An, the set Ay VUAgUA3U:---UA, consists of everything that is 
in at least one of the sets A;. Likewise Ay NA2g:nA3gn-:-NA, consists of 
everything that is common to all of the sets A;. Here is a careful definition. 


Definition 1.7 Suppose A;,Ao,...,A, are sets. Then 


A,UAQgUA3U::-UA, = {x:x€A; for at least one set A;, for 1<i<n}, 
AinAgnAgn:::NAn = {x:xe€A; for every set Aj, for 1<isn}. 


But if the number n of sets is large, these expressions can get messy. 

To overcome this, we now develop some notation that is akin to sigma 
notation. You already know that sigma notation is a convenient symbolism 
for expressing sums of many numbers. Given numbers a1,q2,q3,...,Qn, 
then 

n 

ya: =A, +aAgtagt::'+An. 

i=l 


Even if the list of numbers is infinite, the sum 
CO 
)\ aj =a, +agtagt--t+ajt-. 
i=l 


is often still meaningful. The notation we are about to introduce is very 
similar to this. Given sets A1,A2,A3, ..., An, we define 


n n 
LJ A; =A1UAgUAg3U-+-UA, and ()Ai=Ar1nAgnAgn-:-NAn. 
i=l i=1 


Example 1.9 Suppose Aj = {0,2,5}, Ag = {1,2,5} and Ag = {2,5,7}. Then 


Aj =A,;nNAg nAg = {2,5}. 


3 
LJ) A; =A1UAgUAsz = {0,1,2,5, 7} and 
j=1 


3 
i=1 l= 
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This notation is also used when the list of sets A1,A9,A3, ... is infinite: 


(JA; =A1UAgUA3uU-- = {x:xeA; for at least one set A; with 1< i}. 
i=1 

()Ai=A1nAgnAgn::- = {x:xeA; for every set A; with 1< i}. 

i=1 


Example 1.10 This example involves the following infinite list of sets. 


A1={-1,0,1}, Ao = {-2,0,2}, Az3 = {-3,0,3}, veins) Ai ={-i,0,i}, ri 


Observe that |) A;=Z, and [{)A; = {0}. 
i=l i=l 


Here is a useful twist on our new notation. We can write 


3 
[Ae stil Ae, 


i=l ie{1,2,3} 


as this takes the union of the sets A; for i=1,2,3. Likewise: 


~)e 
ig 
HT 


() Ai 


i=1 i€{1,2,3} 
CO 

UA: = UAi 
i=1 ieN 

CO 

(\Ai = (Ai 
i=1 ieN 


Here we are taking the union or intersection of a collection of sets A; 
where i is an element of some set, be it {1,2,3} or N. In general, the way 
this works is that we will have a collection of sets A; for ic J, where I is 
the set of possible subscripts. The set J is called an index set. 

It is important to realize that the set J need not even consist of integers. 
(We could subscript with letters or real numbers, etc.) Since we are 
programmed to think of i as an integer, let’s make a slight notational 
change: We use a, not i, to stand for an element of J. Thus we are dealing 
with a collection of sets Ag for ae J. This leads to the following definition. 


Definition 1.8 If we have a set A, for every a in some index set J, then 


[JAq = {x : x€ Ag for at least one set Aq with ac J} 


ael 


()Aaq = {x : x€ Ag for every set Ay with ae J}. 


ael 
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Example 1.11 Here the sets Aq will be subsets of R?. Let I = [0,2] = 
{x€R:0<x <2}. For each number ae€/, let Ag = {(x,a):x€R,1<x<2}. For 
instance, given a=1eI the set A; = {(x,1):x€R,1<.x <2} is a horizontal 
line segment one unit above the x-axis and stretching between x =1 and 
x =2, as shown in Figure 1.11(a). Likewise A jg = {(«, V2):x€R,1<x<2h is 
a horizontal line segment V2 units above the x-axis and stretching between 
x=1andx=2. A few other of the Ag are shown in Figure 1.11(a), but they 
can’t all be drawn because there is one A, for each of the infinitely many 
numbers aé¢ [0,2]. The totality of them covers the shaded region in Figure 
1.11(b), so this region is the union of all the Ay. Since the shaded region 
is the set {(x,y)€R?:1<x<2,0<y<2} =[1,2]x [0,2], it follows that 


LJ Aq =[1,21 x [0,2]. 
ae[0,2] 


Likewise, since there is no point (x,y) that is in every set Ag, we have 


[ |): Awe. 
ae[0,2] 
y 
2 
Avs 
1 U Aa 
ae[0,2] 
Ao.s 
Ao.25 
x x 
1 2 
(a) (b) 


Figure 1.11. The union of an indexed collection of sets 


One final comment. Observe that Aq =[1,2]x {a}, so the above expres- 
sions can be written as 


LJ [1,2]x {a}=[1,2]x [0,2] and (| Tj 2lx{a}=9, 
ae[0,2] ae[0,2] 
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Exercises for Section 1.8 
1. Suppose A, = {a,b,d,e,g,f}, Ag ={a,b,c,d}, As = {b,d,a} and Aq = {a,b,h}. 
4 4 
(a) UAi= (b) (]Ai= 
fi i=1 


Ai = {0,2,4,8, 10,12, 14, 16, 18, 20,22, 24}, 
2. Suppose 4 Ag {0,3,6,9, 12,15, 18, 21,24}, 


Il 


A3 = 40,4,8,12,16,20,24}. 
3 3 
(a) UAi= (b) [)Ai= 
i=l i=l 
3. For each néN, let A, = {0,1,2,3,...,n}. 
(a) UAi= (b) ()Ai= 
ieN ieN 
4. For each neéN, let A, = {-2n,0,2n}. 
(a) UAi= (b) ()Ai= 
ieN ieN 
5. (a) Uli,i+ l= (b) ()li,i+1]= 
ieN ieN 
6. (a) (J[0,i+1]= (b) ()[0,i+1]= 
ieN ieN 
7. (a) URxl[i,i+l= (b) ()Rxli,i+1]= 
ieN ieN 
8. (a) LU {a}x[0,1]= (b) [(){a}x[0,1]= 
acR acR 
9. (a) LU X= (b) (] X= 
Xe AN) Xe PAIN) 
10. (a) U [x,11x[0,x71= (b)  () Ix, 1] [0,x7] = 
x€[0,1] xe[0,1] 


11. Is () Aa S LU Aa always true for any collection of sets Aq with index set I? 


ael ael 


12. If () Aa =U Aa, what do you think can be said about the relationships between 
ael ael 


the sets Aq? 


13. If J#¢@ and JC], does it follow that |) AaS LJ) Aa? What about (] Ag [) Aa? 


acd ael acd ael 


14. If J4#@ and JC], does it follow that (] Aa [] Aq? Explain. 


ael acd 
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1.9 Sets that Are Number Systems 


In practice, the sets we tend to be most interested in often have special 
properties and structures. For example, the sets Z, Q and R are familiar 
number systems: Given such a set, any two of its elements can be added 
(or multiplied, etc.) together to produce another element in the set. These 
operations obey the familiar commutative, associative and distributive 
properties that we all have dealt with for years. Such properties lead to 
the standard algebraic techniques for solving equations. Even though we 
are concerned with the idea of proof, we will not find it necessary to define, 
prove or verify such properties and techniques; we will accept them as the 
ground rules upon which our further deductions are based. 

We also accept as fact the natural ordering of the elements of N,Z,Q 
and R, so that (for example) the meaning of “5 <7” is understood and does 
not need to be justified or explained. Similarly, if x < y and a 40, we know 
that ax <ay or ax =ay, depending on whether a is positive or negative. 

Another thing that our ingrained understanding of the ordering of 
numbers tells us is that any non-empty subset of N has a smallest element. 
In other words, if ACN and A #4Q, then there is an element xp ¢ A that is 
smaller than every other element of A. (To find it, start at 1, then move 
in increments to 2, 3, 4, etc., until you hit a number xq € A; this is the 
smallest element of A.) Similarly, given an integer b, any non-empty subset 
Ac{b,b+1,b+2,6+3,...} has a smallest element. This fact is sometimes 
called the well-ordering principle. There is no need to remember this 
term, but do be aware that we will use this simple, intuitive idea often in 
proofs, usually without a second thought. 

The well-ordering principle seems innocent enough, but it actually says 
something very fundamental and special about the positive integers N. 
In fact, the corresponding statement about the positive real numbers 
is false: The subset A = {+:n€N} of the positive reals has no smallest 
element because for any xo = 1 € A that we might pick, there is always a 
smaller element 4, € A. 

One consequence of the well-ordering principle (as we will see below) 
is the familiar fact that any integer a can be divided by a non-zero integer 
b, resulting in a quotient g and remainder r. For example, b = 3 goes 
into a=17 q=5 times with remainder r= 2. In symbols, 17=4-5+2, or 
a=qb+r. This significant fact is called the division algorithm. 


Fact 1.5 (The Division Algorithm) Given integers a and b with b>0, 
there exist integers q and r for which a=qb+r and 0<r<b. 
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Although there is no harm in accepting the division algorithm without 
proof, note that it does follow from the well-ordering principle. Here’s how: 
Given integers a,b with b >0, form the set 


A={a-xb:x€Z,0<a-xb}¢ {0,1,2,3,...}. 


(For example, if a=17 and b= then A = {2,5,8,11,14,17,20,...} is the set 
of positive integers obtained by adding multiples of 3 to 17. Notice that 
the remainder r = 2 of 17+3 is the smallest element of this set.) In general, 
let r be the smallest element of the set A = {a—xb:x¢€Z,0<a-xb}. Then 
r=a-qb for some x=qéZ, soa=qb+r. Moreover, 0<r<b, as follows. 
The fact that re AC {0,1,2,3...$ implies 0< r. In addition, it cannot 
happen that r=b: If this were the case, then the non-negative number 
r—b=(a-qb)—b =a-(q+1)b having form a—xb would be a smaller element 
of A than r, and r was explicitly chosen as the smallest element of A. 
Since it is not the case that r= 0b, it must be that r<b. Therefore 0<r<b. 
We’ve now produced a q and an r for which a= qb+r and 0<r<b. 


1.10 Russell’s Paradox 


This section contains some background information that may be interesting, 
but is not used in the remainder of the book. 

The philosopher and mathematician Bertrand Russell (1872-1970) 
did groundbreaking work on the theory of sets and the foundations of 
mathematics. He was probably among the first to understand how the 
misuse of sets can lead to bizarre and paradoxical situations. He is famous 
for an idea that has come to be known as Russell’s paradox. 

Russell’s paradox involves the following set of sets: 


A={X: Xisaset and X¢X }. (1.1) 


Most sets that we can think of are in A. For example, {1,2} ¢ A because 
{1,2} is a set and {1,2} ¢ {1,2}. Likewise, d¢ A because ¢ is a set and d¢@. 

Is there a set that is not in A? Consider X = {{{{...}}}}. Think of X 
as a box containing a box, containing a box, containing a box, and so on, 
forever. Or a set of Russian dolls, nested one inside the other, endlessly. 
The curious thing about X is that it has just one element, namely X itself: 


X=LU HT 


Thus X « X. As X does not satisfy X ¢ X, Equation (1.1) says X ¢ A. 
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Russell’s paradox arises from the question “Js A an element of A?” 

For a set X, Equation (1.1) says X € A means the same thing as X ¢ X. 
So for X = A, the previous line says A¢ A means the same thing as A¢ A. 
Conclusion: if A ¢ A is true, then it is false; if A¢ A is false, then it is true. 
This is Russell’s paradox. 

Initially Russell’s paradox sparked a crisis among mathematicians. 
How could a mathematical statement be both true and false? This seemed 
to be in opposition to the very essence of mathematics. 

The paradox instigated a very careful examination of set theory and 
an evaluation of what can and cannot be regarded as a set. Eventually 
mathematicians settled upon nine axioms for set theory—the so-called 
Zermelo-Fraenkel axioms. One of these axioms is the well-ordering 
principle of the previous section. Another, the axiom of foundation, states 
that any non-empty set X has an element x for which X nx=4@. This rules 
out such circularly defined “sets” as X = {X} introduced above. If we adhere 
to these axioms, then situations like Russell’s paradox disappear. Most 
mathematicians accept all this on faith and happily ignore the Zermelo- 
Fraenkel axioms. Paradoxes like Russell’s do not tend to come up in 
everyday mathematics—you have to go out of your way to construct them. 

Still, Russell’s paradox reminds us of that precision of thought and 
language is an important part of doing mathematics. The next chapter 
deals with the topic of logic, a codification of thought and language. 


Additional Reading on Sets. For a lively account of Bertrand Russell’s 
life and work (including his paradox), see the graphic novel Logicomix: An 
Epic Search For Truth, by Apostolos Doxiadis and Christos Papadimitriou. 
Also see cartoonist Jessica Hagy’s online strip Indexed—it is based largely 
on Venn diagrams. 


CHAPTER 2 


Logic 


lee is a systematic way of thinking that allows us to deduce new infor- 
mation from old information and to parse the meanings of sentences. 
You use logic informally in everyday life and certainly also in doing mathe- 
matics. For example, suppose you are working with a certain circle, call it 
“Circle X,” and you have available the following two pieces of information. 


1. Circle X has radius equal to 3. 
2. If any circle has radius r, then its area is mr? square units. 


You have no trouble putting these two facts together to get: 
3. Circle X has area 92 square units. 


In doing this you are using logic to combine existing information to 
produce new information. Since a major objective in mathematics is to 
deduce new information, logic must play a fundamental role. This chapter 
is intended to give you a sufficient mastery of logic. 

It is important to realize that logic is a process of deducing information 
correctly, not just deducing correct information. For example, suppose we 
were mistaken and Circle X actually had a radius of 4, not 3. Let’s look at 
our exact same argument again. 


1. Circle X has radius equal to 3. 
2. If any circle has radius r, then its area is mr? square units. 


3. Circle X has area 92 square units. 


The sentence “Circle X has radius equal to 3.” is now untrue, and so is our 
conclusion “Circle X has area 9x square units.” But the logic is perfectly 
correct; the information was combined correctly, even if some of it was 
false. This distinction between correct logic and correct information is 
significant because it is often important to follow the consequences of an 
incorrect assumption. Ideally, we want both our logic and our information 
to be correct, but the point is that they are different things. 
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In proving theorems, we apply logic to information that is considered 
obviously true (such as “Any two points determine exactly one line.”) or is 
already known to be true (e.g., the Pythagorean theorem). If our logic is 
correct, then anything we deduce from such information will also be true 
(or at least as true as the “obviously true” information we began with). 


2.1 Statements 


The study of logic begins with statements. A statement is a sentence 
or a mathematical expression that is either definitely true or definitely 
false. You can think of statements as pieces of information that are either 
correct or incorrect. Thus statements are pieces of information that we 
might apply logic to in order to produce other pieces of information (which 
are also statements). 


Example 2.1 Here are some examples of statements. They are all true. 
If a circle has radius r, then its area is mr? square units. 
Every even number is divisible by 2. 
2EZ 
V2¢Z 
NEZ 
The set {0,1,2} has three elements. 


Some right triangles are isosceles. 


Example 2.2 Here are some additional statements. They are all false. 
All right triangles are isosceles. 
5=2 
V2¢R 
ZN 
{0,1,2}nN=@ 


Example 2.3 Here we pair sentences or expressions that are not state- 
ments with similar expressions that are statements. 


NOT Statements: Statements: 

Add 5 to both sides. Adding 5 to both sides of x -—5 = 37 gives x = 42. 
Z 42eZ 

42 42 is not a number. 

What is the solution of 2x = 84? | The solution of 2x = 84 is 42. 
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Example 2.4 We will often use the letters P, Q, R and S to stand for 
specific statements. When more letters are needed we can use subscripts. 
Here are more statements, designated with letters. You decide which of 
them are true and which are false. 


P: For every integer n >1, the number 2” —-1 is prime. 
Q : Every polynomial of degree n has at most n roots. 
R: The function f(x) =x? is continuous. 

S1:Z2¢@ 

S2:{0,-1,-2}nN=¢ 


Designating statements with letters (as was done above) is a very useful 
shorthand. In discussing a particular statement, such as “The function 
f(x) = x? is continuous,” it is convenient to just refer to it as R to avoid 
having to write or say it many times. 

Statements can contain variables. Here is an example. 


P: If an integer x is a multiple of 6, then x is even. 


This is a sentence that is true. (All multiples of 6 are even, so no matter 
which multiple of 6 the integer x happens to be, it is even.) Since the 
sentence P is definitely true, it is a statement. When a sentence or 
statement P contains a variable such as x, we sometimes denote it as P(x) 
to indicate that it is saying something about x. Thus the above statement 
can be denoted as 


P(x): If an integer x is a multiple of 6, then x is even. 


A statement or sentence involving two variables might be denoted 
P(x,y), and so on. 

It is quite possible for a sentence containing variables to not be a 
statement. Consider the following example. 


Q(x): The integer x is even. 


Is this a statement? Whether it is true or false depends on just which 
integer x is. It is true if x= 4 and false if x =7, etc. But without any 
stipulations on the value of x it is impossible to say whether Q(x) is true or 
false. Since it is neither definitely true nor definitely false, Q(x) cannot be 
a statement. A sentence such as this, whose truth depends on the value 
of one or more variables, is called an open sentence. The variables in 
an open sentence (or statement) can represent any type of entity, not just 
numbers. Here is an open sentence where the variables are functions: 
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R(f,g): The function f is the derivative of the function g. 


This open sentence is true if f(x) =2x and g(x) =x”. It is false if f(x) =x? 
and g(x) =x”, etc. We point out that a sentence such as R(f,g) (that 
involves variables) can be denoted either as R(f,g) or just R. We use the 
expression R(f,g) when we want to emphasize that the sentence involves 
variables. 

We will have more to say about open sentences later, but for now let’s 
return to statements. 

Statements are everywhere in mathematics. Any result or theorem 
that has been proved true is a statement. The quadratic formula and the 
Pythagorean theorem are both statements: 


—b+vVb2-—4ac 


P: The solutions of the equation ax?+bx+c=0 are x= Da 


Q@: Ifa right triangle has legs of lengths a and b and hypotenuse of 
length c, then a? +b? =c?. 


Here is a very famous statement, so famous, in fact, that it has a name. 
It is called Fermat’s last theorem after Pierre Fermat, a seventeenth- 
century French mathematician who scribbled it in the margin of a notebook. 


R: For all numbers a,b,c,n € N with n > 2, it is the case that a” +b” 4c”. 


Fermat believed this statement was true. He noted that he could prove 
it was true, except his notebook’s margin was too narrow to contain his 
proof. It is doubtful that he really had a correct proof in mind, for after his 
death generations of brilliant mathematicians tried unsuccessfully to prove 
that his statement was true (or false). Finally, in 1993, Andrew Wiles of 
Princeton University announced that he had devised a proof. Wiles had 
worked on the problem for over seven years, and his proof runs through 
hundreds of pages. The moral of this story is that some true statements 
are not obviously true. 

Here is another statement famous enough to be named. It was first 
posed in the eighteenth century by the German mathematician Christian 
Goldbach, and thus is called the Goldbach conjecture: 


S: Every even integer greater than 2 is a sum of two prime numbers. 


You must agree that S is either true or false. It appears to be true, because 
when you examine even numbers that are bigger than 2, they seem to 
be sums of two primes: 4=2+2, 6=3+8, 8=3+5, 10=5+5, 12=5+7, 
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100 = 17+ 83 and so on. But that’s not to say there isn’t some large even 
number that’s not the sum of two primes. If such a number exists, then S 
is false. The thing is, in the over 260 years since Goldbach first posed this 
problem, no one has been able to determine whether it’s true or false. But 
since it is clearly either true or false, S is a statement. 

This book is about the methods that can be used to prove that S (or 
any other statement) is true or false. To prove that a statement is true, 
we start with obvious statements (or other statements that have been 
proven true) and use logic to deduce more and more complex statements 
until finally we obtain a statement such as S. Of course some statements 
are more difficult to prove than others, and S appears to be notoriously 
difficult; we will concentrate on statements that are easier to prove. 

But the point is this: In proving that statements are true, we use logic 
to help us understand statements and to combine pieces of information 
to produce new pieces of information. In the next several sections we 
explore some standard ways that statements can be combined to form new 
statements, or broken down into simpler statements. 


Exercises for Section 2.1 


Decide whether or not the following are statements. In the case of a statement, 
say if it is true or false, if possible. 

. Every real number is an even integer. 

. Every even integer is a real number. 

If x and y are real numbers and 5x =5y, then x= y. 

Sets Z and N. 

. Sets Z and N are infinite. 

. Some sets are finite. 

The derivative of any polynomial of degree 5 is a polynomial of degree 6. 
« N¢ AN). 

- cos(x)=—-1 

. (Rx N)A(NxR)=NxN 

. The integer x is a multiple of 7. 


CON TD HAP wD e 


— mk eek 
wo = © 


. If the integer x is a multiple of 7, then it is divisible by 7. 


_ 
Go 


. Either x is a multiple of 7, or it is not. 


—_ 
_~ 


. Call me Ishmael. 


—_ 
oO 


. In the beginning, God created the heaven and the earth. 
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2.2 And, Or, Not 


The word “and” can be used to combine two statements to form a new 
statement. Consider for example the following sentence. 


R,: The number 2 is even and the number 3 is odd. 


We recognize this as a true statement, based on our common-sense under- 
standing of the meaning of the word “and.” Notice that R; is made up of 
two simpler statements: 


P.: The number 2 is even. 
Q: The number 38 is odd. 


These are joined together by the word “and” to form the more complex 
statement R,;. The statement R; asserts that P and Q are both true. Since 
both P and Q are in fact true, the statement R, is also true. 

Had one or both of P and Q been false, then R, would be false. For 
instance, each of the following statements is false. 


Ry: The number 1 is even and the number 3 is odd. 
R3: The number 2 is even and the number 4 is odd. 
R4: The number 3 is even and the number 2 is odd. 


From these examples we see that any two statements P and Q can 
be combined to form a new statement “P and Q.” In the spirit of using 
letters to denote statements, we now introduce the special symbol A to 
stand for the word “and.” Thus if P and Q are statements, PAQ stands 
for the statement “P and Q.” The statement P AQ is true if both P and Q 
are true; otherwise it is false. This is summarized in the following table, 
called a truth table. 


Sa) "ap Ny] 
Spm] oO 
Sy) a] Sy] > 


In this table, T stands for “True,” and F stands for “False.” (T and F are 
called truth values.) Each line lists one of the four possible combinations 
or truth values for P and Q, and the column headed by P AQ tells whether 
the statement P AQ is true or false in each case. 
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Statements can also be combined using the word “or.” Consider the 
following four statements. 


S;,: The number 2 is even or the number 3 is odd. 
So: The number 1 is even or the number 3 is odd. 
S3: The number 2 is even or the number 4 is odd. 
S4: The number 3 is even or the number 2 is odd. 


In mathematics, the assertion “P or Q” is always understood to mean that 
one or both of P and Q is true. Thus statements S1, So, S3 are all true, 
while S4 is false. The symbol v is used to stand for the word “or.” So if P 
and Q are statements, P vQ represents the statement “P or Q.” Here is 
the truth table. 


oe > = BI) 
Saya] o 
BYP AT < 


It is important to be aware that the meaning of “or” expressed in 
the above table differs from the way it is sometimes used in everyday 
conversation. For example, suppose a university official makes the following 
threat: 


You pay your tuition or you will be withdrawn from school. 


You understand that this means that either you pay your tuition or you 
will be withdrawn from school, but not both. In mathematics we never use 
the word “or” in such a sense. For us “or” means exactly what is stated 
in the table for v. Thus PvQ being true means one or both of P and Q 
is true. If we ever need to express the fact that exactly one of P and Q is 
true, we use one of the following constructions: 


P or Q, but not both. 
Either P or Q. 


If the university official were a mathematician, he might have qualified 
his statement in one of the following ways. 


Pay your tuition or you will be withdrawn from school, but not both. 


Either you pay your tuition or you will be withdrawn from school. 
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To conclude this section, we mention another way of obtaining new 
statements from old ones. Given any statement P, we can form the new 
statement “It is not true that P.” For example, consider the following 
statement. 


The number 2 is even. 


This statement is true. Now change it by inserting the words “It is not 
true that” at the beginning: 


It is not true that the number 2 is even. 


This new statement is false. 


For another example, starting with the false statement “2€ @,” we get 
the true statement “It is not true that 2€ @.” 


We use the symbol ~ to stand for the words “It’s not true that,” so 
~P means “It’s not true that P.” We often read ~ P simply as “not P.” 
Unlike A and v, which combine two statements, the symbol ~ just alters 
a single statement. Thus its truth table has just two lines, one for each 
possible truth value of P. 


P||~P 
T || F 
Fl T 


The statement ~ P is called the negation of P. The negation of a 
specific statement can be expressed in numerous ways. Consider 


P: The number 2 is even. 
Here are several ways of expressing its negation. 


~P:It’s not true that the number 2 is even. 
~P-:Itis false that the number 2 is even. 
~P:The number 2 is not even. 


In this section we’ve learned how to combine or modify statements with 
the operations A, v and ~. Of course we can also apply these operations 
to open sentences or a mixture of open sentences and statements. For 
example, (x is an even integer) A(3 is an odd integer) is an open sentence 
that is a combination of an open sentence and a statement. 
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Exercises for Section 2.2 

Express each statement or open sentence in one of the forms PAQ, PvQ, or ~P. 
Be sure to also state exactly what statements P and Q stand for. 

. The number 8 is both even and a power of 2. 

The matrix A is not invertible. 

xy 4,.x<y 5. y=x 

. There is a quiz scheduled for Wednesday or Friday. 

. The number x equals zero, but the number y does not. 

. At least one of the numbers x and y equals 0. 

. xe A-B 10. xe AUB 11. Ac {Xe AN): |X| <oo} 
12. Happy families are all alike, but each unhappy family is unhappy in its own 
way. (Leo Tolstoy, Anna Karenina) 

13. Human beings want to be good, but not too good, and not quite all the time. 
(George Orwell) 


14. A man should look for what is, and not for what he thinks should be. 
(Albert Einstein) 


CHNDWSNY 


2.3 Conditional Statements 


There is yet another way to combine two statements. Suppose we have in 
mind a specific integer a. Consider the following statement about a. 


R: If the integer a is a multiple of 6, then a is divisible by 2. 


We immediately spot this as a true statement based on our knowledge of 
integers and the meanings of the words “if” and “then.” If integer a is a 
multiple of 6, then a is even, so therefore a is divisible by 2. Notice that R 
is built up from two simpler statements: 


P: The integer a is a multiple of 6. 
Q: The integer a is divisible by 2. 
R: If P, then Q. 


In general, given any two statements P and Q whatsoever, we can form 
the new statement “If P, then Q.” This is written symbolically as P>Q 
which we read as “If P, then Q,” or “P implies Q.” Like \ and v, the symbol 
=> has a very specific meaning. When we assert that the statement P > Q 
is true, we mean that if P is true then Q must also be true. (In other words 
we mean that the condition P being true forces Q to be true.) A statement 
of form P > Q is called a conditional statement because it means Q will 
be true under the condition that P is true. 
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You can think of P > Q as being a promise that whenever P is true, Q 
will be true also. There is only one way this promise can be broken (i.e. 
be false) and that is if P is true but Q is false. Thus the truth table for 
the promise P > Q is as follows: 


Aya} A]) A || 
SYP] © 
ARPS] Y 


Perhaps you are bothered by the fact that P > Q is true in the last two 
lines of this table. Here’s an example to convince you that the table is 
correct. Suppose your professor makes the following promise: 


If you pass the final exam, then you will pass the course. 
Your professor is making the promise 
(You pass the exam) => (You pass the course). 


Under what circumstances did she lie? There are four possible scenarios, 
depending on whether or not you passed the exam and whether or not you 
passed the course. These scenarios are tallied in the following table. 


You pass exam | You pass course || (You pass exam) => (You pass course) 
T T T 
T F F 
F T T 
F F T 


The first line describes the scenario where you pass the exam and you 
pass the course. Clearly the professor kept her promise, so we put a T in 
the third column to indicate that she told the truth. In the second line, 
you passed the exam, but your professor gave you a failing grade in the 
course. In this case she broke her promise, and the F in the third column 
indicates that what she said was untrue. 

Now consider the third row. In this scenario you failed the exam but 
still passed the course. How could that happen? Maybe your professor felt 
sorry for you. But that doesn’t make her a liar. Her only promise was that 
if you passed the exam then you would pass the course. She did not say 


Conditional Statements Al 


passing the exam was the only way to pass the course. Since she didn’t 
lie, then she told the truth, so there is a T in the third column. 

Finally look at the fourth row. In that scenario you failed the exam 
and you failed the course. Your professor did not lie; she did exactly what 
she said she would do. Hence the T in the third column. 

In mathematics, whenever we encounter the construction “If P, then 
Q” it means exactly what the truth table for > expresses. But of course 
there are other grammatical constructions that also mean P>Q. Here is 
a summary of the main ones. 


If P, then Q. 

Q if P. 

Q whenever P. 

Q, provided that P. 

Whenever P, then also Q. 

P is a sufficient condition for Q. 
For Q, it is sufficient that P. 

Q@ is a necessary condition for P. 
For P, it is necessary that Q. 

P only if Q. 


P>Q 


These can all be used in the place of (and mean exactly the same thing as) 
“If P, then Q.” You should analyze the meaning of each one and convince 
yourself that it captures the meaning of P>Q. For example, P > Q means 
the condition of P being true is enough (i.e., sufficient) to make Q true; 
hence “P is a sufficient condition for Q.” 

The wording can be tricky. Often an everyday situation involving a 
conditional statement can help clarify it. For example, consider your 
professor’s promise: 


(You pass the exam) => (You pass the course) 


This means that your passing the exam is a sufficient (though perhaps 
not necessary) condition for your passing the course. Thus your professor 
might just as well have phrased her promise in one of the following ways. 


Passing the exam is a sufficient condition for passing the course. 
For you to pass the course, it is sufficient that you pass the exam. 
However, when we want to say “If P, then Q” in everyday conversation, 
we do not normally express this as “Q is a necessary condition for P” or 


“P only if Q.” But such constructions are not uncommon in mathematics. 
To understand why they make sense, notice that P > Q being true means 
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that it’s impossible that P is true but Q is false, so in order for P to be 
true it is necessary that Q is true; hence “Q is a necessary condition for 
P.” And this means that P can only be true if Q is true, i.e., “P only if Q.” 


Exercises for Section 2.3 


Without changing their meanings, convert each of the following sentences into a 
sentence having the form “If P, then Q.” 

. A matrix is invertible provided that its determinant is not zero. 

. For a function to be continuous, it is sufficient that it is differentiable. 

For a function to be integrable, it is necessary that it is continuous. 

. A function is rational if it is a polynomial. 

An integer is divisible by 8 only if it is divisible by 4. 

Whenever a surface has only one side, it is non-orientable. 

. Aseries converges whenever it converges absolutely. 


. A geometric series with ratio r converges if |r| <1. 


CHN AARON 


. A function is integrable provided the function is continuous. 


_ 
i=) 


. The discriminant is negative only if the quadratic equation has no real solutions. 


j—_ 
—_ 


. You fail only if you stop writing. (Ray Bradbury) 


_ 
bo 


. People will generally accept facts as truth only if the facts agree with what 
they already believe. (Andy Rooney) 
13. Whenever people agree with me I feel I must be wrong. (Oscar Wilde) 


2.4 Biconditional Statements 


It is important to understand that P > Q is not the same as Q=> P. To see 
why, suppose that a is some integer and consider the statements 


(a isa multiple of 6) => (a is divisible by 2), 
(a is divisible by 2) => (ais a multiple of 6). 


The first statement asserts that if a is a multiple of 6 then a is divisible 
by 2. This is clearly true, for any multiple of 6 is even and therefore 
divisible by 2. The second statement asserts that if a is divisible by 2 then 
it is a multiple of 6. This is not necessarily true, for a = 4 (for instance) is 
divisible by 2, yet not a multiple of 6. Therefore the meanings of P > Q and 
Q => P are in general quite different. The conditional statement Q = P is 
called the converse of P > Q, so a conditional statement and its converse 
express entirely different things. 
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But sometimes, if P and Q are just the right statements, it can happen 
that P>Q and Q@ =P are both necessarily true. For example, consider 
the statements 


(ais even) = (a is divisible by 2), 


(a is divisible by 2) => (ais even). 


No matter what value a has, both of these statements are true. Since both 
P=>Q and Q=>P are true, it follows that (P > Q)A(Q = P) is true. 

We now introduce a new symbol < to express the meaning of the 
statement (P > Q)A(Q => P). The expression P = Q is understood to have 
exactly the same meaning as (P>Q)A(Q =P). According to the previous 
section, Q > P is read as “P if Q,” and P=>Q can be read as “P only if Q.” 
Therefore we pronounce P = Q as “P if and only if Q.” For example, given 
an integer a, we have the true statement 


(a is even) } (a is divisible by 2), 


which we can read as “Integer a is even if and only if a is divisible by 2.” 

The truth table for = is shown below. Notice that in the first and last 
rows, both P>Q and Q =P are true (according to the truth table for =>), 
so (P > Q)A(Q = P) is true, and hence P © Q is true. However, in the 
middle two rows one of P> Q or Q = P is false, so (P > Q)A(Q = P) is false, 
making P = Q false. 


PI|Q|P<eQ 
T|T T 
T|F F 
F\T F 
F\F T 


Compare the statement R :(a is even) = (a is divisible by 2) with this 
truth table. If a is even then the two statements on either side of © 
are true, so according to the table R is true. If a is odd then the two 
statements on either side of = are false, and again according to the table 
R is true. Thus R is true no matter what value a has. In general, P= Q 
being true means P and Q are both true or both false. 

Not surprisingly, there are many ways of saying P = @ in English. The 
following constructions all mean P <Q: 
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P if and only if Q. 

P is a necessary and sufficient condition for Q. 
For P it is necessary and sufficient that Q. 

If P, then Q, and conversely. 


P<sQ 


The first three of these just combine constructions from the previous 
section to express that P > Q and Q = P. In the last one, the words “..and 
conversely” mean that in addition to “If P, then Q” being true, the converse 
statement “If Q, then P” is also true. 


Exercises for Section 2.4 

Without changing their meanings, convert each of the following sentences into a 
sentence having the form “P if and only if Q.” 

1. For matrix A to be invertible, it is necessary and sufficient that det(A) 4 0. 

. If a function has a constant derivative then it is linear, and conversely. 

. If xy=0 then x=0 or y=0, and conversely. 

. IfaeQ then 5a€ Q, and if 5a€ Q then aeEQ. 


. For an occurrence to become an adventure, it is necessary and sufficient for 
one to recount it. (Jean-Paul Sartre) 


a, WwW WD 


2.5 Truth Tables for Statements 


You should now know the truth tables for ~, v, ~, > and «. They should 
be internalized as well as memorized. You must understand the symbols 
thoroughly, for we now combine them to form more complex statements. 

For example, suppose we want to convey that one or the other of P and 
Q is true but they are not both true. No single symbol expresses this, but 
we could combine them as 


(PvQ)A ~(P AQ), 


which literally means: 


P or Q is true, and it is not the case that both P and Q are true. 


This statement will be true or false depending on the truth values of P 
and Q. In fact we can make a truth table for the entire statement. Begin 
as usual by listing the possible true/false combinations of P and Q on four 
lines. The statement (P vQ)A ~ (P AQ) contains the individual statements 
(PvQ) and (P AQ), so we next tally their truth values in the third and 
fourth columns. The fifth column lists values for ~(P AQ), and these 
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are just the opposites of the corresponding entries in the fourth column. 
Finally, combining the third and fifth columns with ”, we get the values 
for (PV Q)A ~(P AQ) in the sixth column. 


P|Q| @vQ) | (PAQ) | ~PAQ) (PVQ)A ~(P AQ) 
T|T vi T F F 
T\|F T F T T 
F|\T T. F T T 
F\F F F T F 


This truth table tells us that (PVQ)A ~ (P AQ) is true precisely when 
one but not both of P and Q are true, so it has the meaning we intended. 
(Notice that the middle three columns of our truth table are just “helper 
columns” and are not necessary parts of the table. In writing truth tables, 
you may choose to omit such columns if you are confident about your work.) 

For another example, consider the following familiar statement con- 
cerning two real numbers x and y: 


The product xy equals zero if and only if x=0 or y=0. 


This can be modeled as (xy=0)@(x=0 v y=0). If we introduce letters 
P,Q and R for the statements xy =0, x=0 and y=0, it becomes P © (QVR). 
Notice that the parentheses are necessary here, for without them we 
wouldn’t know whether to read the statement as P = (QvVR) or (P©Q)vVR. 

Making a truth table for P = (QvVR) entails a line for each T/F combina- 
tion for the three statements P, Q and R. The eight possible combinations 
are tallied in the first three columns of the following table. 


P\Q|R| Q@vR | Pe@QvR) 
T|T|T T T 
T\|T|F T T 
T|FI/T T T 
T|F\|F| F F 
F(/T|T T F 
F\T|F T F 
F\/F|T T F 
F\F|F| F T 


We fill in the fourth column using our knowledge of the truth table 
for v. Finally the fifth column is filled in by combining the first and fourth 
columns with our understanding of the truth table for =. The resulting 
table gives the true/false values of P = (Q Vv R) for all values of P,Q and R. 
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Notice that when we plug in various values for x and y, the statements 
P:xy=0,Q:x=0 and R:y=0 have various truth values, but the statement 
P o(QVR) is always true. For example, if x =2 and y=3, then P,Q and R 
are all false. This scenario is described in the last row of the table, and 
there we see that P = (QvR) is true. Likewise if x=0 and y=7, then P 
and Q are true and R is false, a scenario described in the second line of 
the table, where again P = (Qv R) is true. There is a simple reason why 
P @(QVR) is true for any values of x and y: It is that P = (QV R) represents 
(xy =0)@(x=0 v y=0), which is a true mathematical statement. It is 
absolutely impossible for it to be false. 

This may make you wonder about the lines in the table where P = (QVR) 
is false. Why are they there? The reason is that P = (QvR) can also 
represent a false statement. To see how, imagine that at the end of the 
semester your professor makes the following promise. 


You pass the class if and only if you get an “A” on the final or you get 
a “B” on the final. 


This promise has the form P = (QvR), so its truth values are tabulated in 
the above table. Imagine it turned out that you got an “A” on the exam 
but failed the course. Then surely your professor lied to you. In fact, P is 
false, Q is true and R is false. This scenario is reflected in the sixth line 
of the table, and indeed P = (Q Vv R) is false (.e., it is a lie). 

The moral of this example is that people can lie, but true mathematical 
statements never lie. 

We close this section with a word about the use of parentheses. The 
symbol ~ is analogous to the minus sign in algebra. It negates the 
expression it precedes. Thus ~PvQ means (~ P)vQ, not ~(PvQ). In 
~(P vQ), the value of the entire expression P vQ is negated. 


Exercises for Section 2.5 


Write a truth table for the logical statements in problems 1-9: 


1. PvQ=>R) 4. ~(PvQ)v(~P) 7. (PA~P)>@Q 
2. QVR)e(RAQ) 5. (PA~P)v@Q 8 Pv(QA~R) 
3. ~(P>Q) 6. (PA~P)AQ 9. ~(~Pv~@Q) 


10. Suppose the statement ((P\Q)v R)>(RVS) is false. Find the truth values of 
P,Q,R and S. (This can be done without a truth table.) 

11. Suppose P is false and that the statement (R > S)@(P AQ) is true. Find the 
truth values of R and S. (This can be done without a truth table.) 
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2.6 Logical Equivalence 


In contemplating the truth table for P = Q, you probably noticed that 
P <Q is true exactly when P and Q are both true or both false. In other 
words, P = Q is true precisely when at least one of the statements P AQ 
or ~PA~Q is true. This may tempt us to say that P @Q means the same 
thing as (PAQ)v(~ PA ~Q). 

To see if this is really so, we can write truth tables for P= Q and 
(PAQ)v(~PA~Q). In doing this, it is more efficient to put these two 
statements into the same table, as follows. (This table has helper columns 
for the intermediate expressions ~ P, ~Q, (P AQ) and (~ PA ~ Q).) 


P|Q | ~P | ~Q | AQ) | (~PA~Q) | (PAQ)V(~PA~Q) | PQ 
Al Fe | (Sa eg T F T T 
T|F| F | T F F F F 
FT ||) 48") F F F F 
F\F\| T |) T F T T T 


The table shows that P= Q and (PAQ)v(~ PA ~ Q) have the same truth 
value, no matter the values P and Q. It is asif P = Q and (PAQ)v(~ Pa ~Q) 
are algebraic expressions that are equal no matter what is “plugged into” 
variables P and Q. We express this state of affairs by writing 


PoQ = (PAQ)v(~PA~Q) 


and saying that P= Q and (PAQ)v(~ PA ~Q) are logically equivalent. 
In general, two statements are logically equivalent if their truth 
values match up line-for-line in a truth table. 
Logical equivalence is important because it can give us different (and 
potentially useful) ways of looking at the same thing. As an example, the 
following table shows that P > Q is logically equivalent to (~ Q) => (~ P). 


P|Q]~P|~Q | (~Q@)>(P)|P>Q 
ye Gl en Ve T T 
val coal (Ee am (ea F F 
F\|T|| T | F T T 
| He | T T 


The fact that P > Q is logically equivalent to (~ Q) > (~ P) is especially 
important because so many theorems are statements having the form 
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P=@Q. As we will see in Chapter 5, proving such a theorem may be easier 
if we express it in the logically equivalent form (~ Q) => (~ P). 

There are two pairs of logically equivalent statements that come up 
again and again throughout this book and beyond. They are prevalent 
enough to be dignified by a special name: DeMorgan’s laws. 


Fact 2.1 (DeMorgan’s Laws) 
1. ~(PAQ) = (~P)vV(~Q) 
2 ~(PvQ) = (~P)A(~Q) 


The first of DeMorgan’s laws is verified by the following table. You are 
asked to verify the second in one of the exercises. 


P/Q ~P | ~Q |) PAQ | ~(PAQ®) | (~P)v(~Q) 
#8 (|e oa am F F 
T|F| F | T | F T r 
F\|T| T | F | F T T 
Be Poa oe T T 


DeMorgan’s laws are actually very natural and intuitive. Consider the 
statement ~(P AQ), which we can interpret as meaning that it is not the 
case that both P and Q are true. If it is not the case that both P and Q 
are true, then at least one of P or Q is false, in which case (~ P)v(~ Q) is 
true. Thus ~(P AQ) means the same thing as (~ P) v (~ Q). 

DeMorgan’s laws can be very useful. Suppose we happen to know that 
some statement having form ~ (Pv Q) is true. The second of DeMorgan’s 
laws tells us that (~ Q)A(~ P) is also true, hence ~ P and ~ Q are both true 
as well. Being able to quickly obtain such additional pieces of information 
can be extremely useful. 

Here is a summary of some significant logical equivalences. Those that 
are not immediately obvious can be verified with a truth table. 


P>@Q = Q)=>(~P) Contrapositive law (2.1) 

_ . o. Oy = a, 5 \ DeMorgan’s laws (2,2) 

Pe Z oP \ Commutative laws (2.3) 

P ¢ 6 es = : . 3) i a ea \ Distributive laws (2.4) 
POOR) = (Pvd\vh } Associative laws (2.5) 
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Notice how the distributive law P\(Q vR)=(PAQ)v(P AR) has the 
same structure as the distributive law p-(q+r)=p-q+p-r from algebra. 
Concerning the associative laws, the fact that PA(QAR)=(PAQ)AR means 
that the position of the parentheses is irrelevant, and we can write this as 
PAQAR without ambiguity. Similarly, we may drop the parentheses in 
an expression such as Pv(QvR). 

But parentheses are essential when there is a mix of A and v, as in 
Pv(QAR). Indeed, Pv(QAR) and (PvQ)AR are not logically equivalent. 
(See Exercise 13 for Section 2.6, below.) 


Exercises for Section 2.6 


A. Use truth tables to show that the following statements are logically equivalent. 


1. PAQVR)=(PAQ)V(P AR) 5. ~(PVQVR) = (~P)A(~Q)A(~R) 
2. Pv(QAR)=(PVQ)A(PVR) 6. ~(PAQAR) = (~P)v(~Q)v(~R) 
3. P=>Q=(~P)vQ 7 P>Q = (PA~Q)>@QA~Q) 
4, ~(PvVQ) = (~P)A(~Q) 8. ~P oQ=(P>~Q)A(~Q=>P) 


B. Decide whether or not the following pairs of statements are logically equivalent. 
9. PaQ and ~(~ Pv ~Q) 12. ~(P=>Q) and PA~Q 
10. (P=>Q)VR and ~(PA~Q)A~R) 138. PV(QAR) and (PVQ)AR 
11. (~P)AP=Q) and ~(Q=>P) 14. Pa(Qv ~Q) and (~ P) >(Qa ~Q) 


2.7 Quantifiers 


Using symbols a, v, ~, > and ©, we can deconstruct many English 
sentences into a symbolic form. As we have seen, this symbolic form can 
help us understand the logical structure of sentences and how different 
sentences may actually have the same meaning (as in logical equivalence). 

But these symbols alone are not powerful enough to capture the full 
meaning of every statement. To help overcome this defect, we introduce 
two new symbols that correspond to common mathematical phrases. The 
symbol “VY” stands for the phrase “For all” or “For every.” The symbol “3” 
stands for the phrase “There exists a” or “There is a.” Thus the statement 


For every n€ Z, 2n is even, 
can be expressed in either of the following ways: 


Vne Z, 2n is even, 


Vneé Z, E(2n). 
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Likewise, a statement such as 
There exists a subset X of N for which |X| =5. 
can be translated as 
AX (X CN)A(X|=5) or AXCN,|X|=5 or AXE A(N),|X|=5. 


The symbols V and i are called quantifiers because they refer in some 
sense to the quantity (i.e., all or some) of the variable that follows them. 
Symbol V is called the universal quantifier and 3 is called the existen- 
tial quantifier. Statements which contain them are called quantified 
statements. A statement beginning with V is called a universally quan- 
tified statement, and one beginning with i is called an existentially 
quantified statement. 


Example 2.5 The following English statements are paired with their 
translations into symbolic form. 


Every integer that is not odd is even. 
VYneZ,~(nis odd )>(niseven), or VWneZ,~O(n)>E(n). 


There is an integer that is not even. 
dneéZ,~ E(n). 


For every real number x, there is a real number y for which y® = x. 
VxeRJyeR y? =x. 


Given any two rational numbers a and 8, it follows that ab is rational. 
Va,beQ,abeQ. 


Given a set S (such as, but not limited to, N, Z, Q etc.), a quantified 
statement of form Vx « S,P(x) is understood to be true if P(x) is true 
for every xe S. If there is at least one x¢ S for which P(x) is false, then 
VxeS,P(x) is a false statement. Similarly, 3x € S,P(x) is true provided that 
P(x) is true for at least one element x€S; otherwise it is false. Thus each 
statement in Example 2.5 is true. Here are some examples of quantified 
statements that are false: 


Example 2.6 The following false quantified statements are paired with 
their translations. 


Every integer is even. 
VYneZ,E(n). 
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There is an integer n for which n? = 2. 
Ane Z,n? =2. 


For every real number x, there is a real number y for which y? = x. 
VxeR AyvyEeR,y? =x. 


Given any two rational numbers a and 3, it follows that Vab is rational. 


Va,beEQ,VabeQ. 


Example 2.7 When a statement contains two quantifiers you must be 
very alert to their order, for reversing the order can change the meaning. 
Consider the following statement from Example 2.5. 


VxeERJyeR y? =x. 


This statement is true, for no matter what number x is there exists a 
number y = 3/x for which y? = x. Now reverse the order of the quantifiers 
to get the new statement 


JyvyeER, VxeR,y =x. 


This new statement says that there exists a particular number y with 
the property that y? =x for every real number x. Since no number y can 
have this property, the statement is false. The two statements above have 
entirely different meanings. 


Quantified statements are often misused in casual conversation. Maybe 
you've heard someone say “All students do not pay full tuition.” when they 
mean “Not all students pay full tuition.” While the mistake is perhaps 
marginally forgivable in casual conversation, it must never be made in a 
mathematical context. Do not say “All integers are not even.” because that 
means there are no even integers. Instead, say “Not all integers are even.” 


Exercises for Section 2.7 


Write the following as English sentences. Say whether they are true or false. 


1. VxER,x?>0 6. IneN, VX € AN),|X|<n 
2. VxeERAneN,x” =0 7 VX CN Ane Z,|X|=n 

3. Jae R,VxeR,ax=x 8. VneZ, IX CN,|X|=n 
4, VXe PN),XCR 9. VneZ,imeZ,m=ant5 
5. VneN, AX € AN), |X| <n 10. me ZVneZme=n+5 
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2.8 More on Conditional Statements 


It is time to address a very important point about conditional statements 
that contain variables. To motivate this, let’s return to the following 
example concerning integers x: 


(x is a multiple of 6) => (x is even). 


As noted earlier, since every multiple of 6 is even, this is a true statement 
no matter what integer x is. We could even underscore this fact by writing 
this true statement as 


Vxe Z,(x is a multiple of 6) => (x is even). 
But now switch things around to get the different statement 
(x is even) > (x is a multiple of 6). 


This is true for some values of x such as —6, 12, 18, etc., but false for 
others (such as 2, 4, etc.). Thus we do not have a statement, but rather an 
open sentence. (Recall from Section 2.1 that an open sentence is a sentence 
whose truth value depends on the value of a certain variable or variables.) 
However, by putting a universal quantifier in front we get 


Vxe Z,(x is even) > (x is a multiple of 6), 


which is definitely false, so this new expression is a statement, not an open 
sentence. In general, given any two open sentences P(x) and Q(x) about 
integers x, the expression Vx € Z, P(x) > Q(x) is either true or false, so it is 
a statement, not an open sentence. 

Now we come to the very important point. In mathematics, whenever 
P(x) and Q(x) are open sentences concerning elements x in some set S 
(depending on context), an expression of form P(x) > Q(x) is understood 
to be the statement Vx e€ S, P(x) > Q(x). In other words, if a conditional 
statement is not explicitly quantified then there is an implied universal 
quantifier in front of it. This is done because statements of the form 
VxeS, P(x) => Q(x) are so common in mathematics that we would get tired 
of putting the VxeS in front of them. 


Thus the following sentence is a true statement (as it is true for all x). 


If x is a multiple of 6, then x is even. 
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Likewise, the next sentence is a false statement (as it is not true for all x). 
If x is even, then x is a multiple of 6. 


This leads to the following significant interpretation of a conditional 
statement, which is more general than (but consistent with) the interpre- 
tation from Section 2.3. 


Definition 2.1 If P and Q are statements or open sentences, then 
“If P, then Q,” 


is a statement. This statement is true if it’s impossible for P to be true 
while Q is false. It is false if there is at least one instance in which P is 
true but Q is false. 


Thus the following are true statements: 


If xe R, then x?+1>0. 


If a function f is differentiable on R, then f is continuous on R. 
Likewise, the following are false statements: 


If p is a prime number, then p is odd. (2 is prime.) 


If f is a rational function, then f has an asymptote. (x? is rational.) 


2.9 Translating English to Symbolic Logic 


In writing (and reading) proofs of theorems, we must always be alert to the 
logical structure and meanings of the sentences. Sometimes it is necessary 
or helpful to parse them into expressions involving logic symbols. This may 
be done mentally or on scratch paper, or occasionally even explicitly within 
the body of a proof. The purpose of this section is to give you sufficient 
practice in translating English sentences into symbolic form so that you 
can better understand their logical structure. Here are some examples: 


Example 2.8 Consider the Mean Value Theorem from Calculus: 


If f is continuous on the interval [a,b] and differentiable on (a,b), then 


there is a number c€(a,b) for which f'(c) = Mee) 


Here is a translation to symbolic form: 


((r cont. on [a,b]) A(f is diff. on (a,6))} me (ce (a,b), f= Loy fa), 
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Example 2.9 Consider Goldbach’s conjecture, from Section 2.1: 
Every even integer greater than 2 is the sum of two primes. 


This can be translated in the following ways, where P is the set of prime 
numbers and S = {4,6,8,10,...} is the set of even integers greater than 2. 


(ne S)=>(Ap,qeP,n=pt+q) 
VneS,ip,qeP,n=pt+q 


These translations of Goldbach’s conjecture illustrate an important 
point. The first has the basic structure (n € S) > Q(n) and the second has 
structure VneéeS,Q(n), yet they have exactly the same meaning. This is 
significant. Every universally quantified statement can be expressed as a 
conditional statement. 


Fact 2.2 Suppose S is a set and Q(x) is a statement about x for each 
xé€S. The following statements mean the same thing: 


VYxeS, Q(x) 
(x € S)> Q(x). 


This fact is significant because so many theorems have the form of 
a conditional statement. (The Mean Value Theorem is an example!) In 
proving a theorem we have to think carefully about what it says. Sometimes 
a theorem will be expressed as a universally quantified statement but it will 
be more convenient to think of it as a conditional statement. Understanding 
the above fact allows us to switch between the two forms. 

We close this section with some final points. In translating a state- 
ment, be attentive to its intended meaning. Don’t jump into, for example, 
automatically replacing every “and” with A and “or” with v. An example: 


At least one of the integers x and y is even. 


Don’t be led astray by the presence of the word “and.” The meaning of 
the statement is that one or both of the numbers is even, so it should be 
translated with “or,” not “and”: 


(x is even) v (y is even). 


Finally, the logical meaning of “but” can be captured by “and.” The 
sentence “The integer x is even, but the integer y is odd,” is translated as 


(x is even) A (y is odd). 
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Exercises for Section 2.9 


Translate each of the following sentences into symbolic logic. 

1. If f is a polynomial and its degree is greater than 2, then /’ is not constant. 
. The number «x is positive but the number y is not positive. 

. If x is prime then \/x is not a rational number. 

. For every prime number p there is another prime number q with q > p. 


ao ® & NW 


. For every positive number ¢, there is a positive number 6 for which |x-a| <6 
implies |f(x)— f(a)| <e. 

6. For every positive number « there is a positive number M for which |f(x)—}| <e, 

whenever x > M. 


7. There exists a real number a for which a+x =x for every real number x. 
8. I don’t eat anything that has a face. 
9. If x is a rational number and x 40, then tan(x) is not a rational number. 
10. If sin(x) <0, then it is not the case that O<x<n. 
11. There is a Providence that protects idiots, drunkards, children and the United 
States of America. (Otto von Bismarck) 
12. You can fool some of the people all of the time, and you can fool all of the people 
some of the time, but you can’t fool all of the people all of the time. (Abraham 
Lincoln) 


13. Everything is funny as long as it is happening to somebody else. (Will Rogers) 


2.10 Negating Statements 


Given a statement R, the statement ~ R is called the negation of R. If R 
is a complex statement, then it is often the case that its negation ~ R can 
be written in a simpler or more useful form. The process of finding this 
form is called negating R. In proving theorems it is often necessary to 
negate certain statements. We now investigate how to do this. 

We have already examined part of this topic. DeMorgan’s laws 


~(P AQ) 
~(PvQ) 


(~P)v(~Q) (2.6) 
(~P)A(~Q) (2.7) 


(from Section 2.6) can be viewed as rules that tell us how to negate the 
statements PAQ and PvQ. Here are some examples that illustrate how 
DeMorgan’s laws are used to negate statements involving “and” or “or.” 
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Example 2.10 Consider negating the following statement. 
R: You can solve it by factoring or with the quadratic formula. 


Now, R means (You can solve it by factoring) v (You can solve it with Q.F.), 
which we will denote as PvQ. The negation of this is 


~(PvQ) = (~P)A(~Q). 


Therefore, in words, the negation of R is 


~R: You can’t solve it by factoring and you can’t solve it with 
the quadratic formula. 


Maybe you can find ~ R without invoking DeMorgan’s laws. That is good; 
you have internalized DeMorgan’s laws and are using them unconsciously. 
Example 2.11 We will negate the following sentence. 

R: The numbers x and y are both odd. 


This statement means (x is odd) A (y is odd), so its negation is 


~ (( is odd)A(y is odd)) = ~(x is odd) v ~(y is odd) 
= (x is even) V(y is even). 
Therefore the negation of R can be expressed in the following ways: 


~R: The number x is even or the number y is even. 
~R: At least one of x and y is even. 


Now let’s move on to a slightly different kind of problem. It’s often 
necessary to find the negations of quantified statements. For example, 
consider ~ (VxeN, P(x)). Reading this in words, we have the following: 


It is not the case that P(x) is true for all natural numbers x. 


This means P(x) is false for at least one x. In symbols, this is 4x €N, ~ P(x). 
Thus ~ (Vx EN, P(x)) = 4x eEN, ~ P(x). Similarly, you can reason out that 
~ (Ax éeN, P(x)) =VxeN, ~ P(x). In general: 


~(VxeS,P(x)) = AxeS,~ P(x), (2.8) 
~ (axe S, P(x)) VxeES,~ P(x). (2.9) 
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Example 2.12 Consider negating the following statement. 
R: The square of every real number is non-negative. 


Symbolically, R can be expressed as Vx € R, x2>0, and thus its negation is 
~(Vx ER, x?2>0) = JxeER, ~ (x? =0) = 3x ER, x? <0. In words, this is 


~R: There exists a real number whose square is negative. 


Observe that R is true and ~ R is false. You may be able to get ~R 
immediately, without using Equation (2.8) as we did above. If so, that is 
good; if not, you will probably be there soon. 


If a statement has multiple quantifiers, negating it will involve several 
iterations of Equations (2.8) and (2.9). Consider the following: 


S: For every real number «x there is a real number y for which y® = x. 


This statement asserts any real number x has a cube root y, so it’s true. 
Symbolically S can be expressed as 


VxeRAyeR, y?=x. 


Let’s work out the negation of this statement. 


~(VxERAyER y=x) = AxER,~GyEeR,y® =x) 
AxER, VyER, ~(y? =x) 
AxER, VyER, y Fx. 


Therefore the negation is the following (false) statement. 


~S: There is a real number x for which y? £x for all real numbers y. 


In writing proofs you will sometimes have to negate a conditional 
statement P > Q. The remainder of this section describes how to do this. 
To begin, look at the expression ~ (P > Q), which literally says “P > Q is 
false.” You know from the truth table for > that the only way that P>Q 
can be false is if P is true and Q is false. Therefore ~(P>Q)= PA ~Q. 


~(P>Q) =PaA~Q (2.10) 


(In fact, in Exercise 12 of Section 2.6, you used a truth table to verify that 
these two statements are indeed logically equivalent.) 
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Example 2.13 Negate the following statement about a particular (i.e., 
constant) number a. 

R: If a is odd then a? is odd. 
Using Equation (2.10), we get the following negation. 

~R:a is odd and a? is not odd. 


Example 2.14 This example is like the previous one, but the constant a 
is replaced by a variable x. We will negate the following statement. 


R:If x is odd then x? is odd. 


As discussed in Section 2.8, we interpret this as the universally quantified 
statement 


R: Vx eZ, (x odd) => (x? odd). 


By Equations (2.8) and (2.10), we get the following negation for R. 


~ (vx Z, (x odd) > (x? odd)) = AxeZ,~ ((x odd)=> (x? odd)) 
Ax € Z,(x odd)a ~ (x? odd). 


Translating back into words, we have 
~R: There is an odd integer x whose square is not odd. 


Notice that R is true and ~ R is false. 

The above Example 2.14 showed how to negate a conditional statement 
P(x) > Q(x). This type of problem can sometimes be embedded in more 
complex negation. See Exercise 5 below (and its solution). 


Exercises for Section 2.10 


Negate the following sentences. 

. The number x is positive, but the number y is not positive. 

. If x is prime, then x is not a rational number. 

. For every prime number p, there is another prime number q with q > p. 


Pe OO DN = 


. For every positive number e, there is a positive number 6 such that |x-a|<6 
implies |f(x)—f(a)| <e. 
5. For every positive number e¢, there is a positive number M for which |f(x)—b| <« 
whenever x > M. 
6. There exists a real number a for which a+x=«x for every real number x. 
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7. I don’t eat anything that has a face. 

8. If x is a rational number and x 40, then tan(x) is not a rational number. 

9. If sin(x) <0, then it is not the case that O<x<z. 
10. If f is a polynomial and its degree is greater than 2, then f’ is not constant. 
11. You can fool all of the people all of the time. 


12. Whenever I have to choose between two evils, I choose the one I haven’t tried 
yet. (Mae West) 


2.11 Logical Inference 


Suppose we know that a statement of form P > Q is true. This tells us 
that whenever P is true, Q will also be true. By itself, P>Q being true 
does not tell us that either P or Q is true (they could both be false, or P 
could be false and Q true). However if in addition we happen to know 
that P is true then it must be that Q is true. This is called a logical 
inference: Given two true statements we can infer that a third statement 
is true. In this instance true statements P > Q and P are “added together” 
to get Q. This is described below with P > Q and P stacked one atop the 
other with a line separating them from Q. The intended meaning is that 
P=>Q combined with P produces Q. 


P=>Q P>Q PvQ 
P ~Q ~P 
Q ~P Q 


Two other logical inferences are listed above. In each case you should 
convince yourself (based on your knowledge of the relevant truth tables) 
that the truth of the statements above the line forces the statement below 
the line to be true. 

Following are some additional useful logical inferences. The first 
expresses the obvious fact that if P and Q are both true then the statement 
P AQ will be true. On the other hand, P~Q being true forces P (also Q) 
to be true. Finally, if P is true, then PvQ must be true, no matter what 
statement Q is. 


5 PAQ P 


PrQ P Pv@ 


These inferences are so intuitively obvious that they scarcely need to 
be mentioned. However, they represent certain patterns of reasoning that 
we will frequently apply to sentences in proofs, so we should be cognizant 
of the fact that we are using them. 
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2.12 An Important Note 


It is important to be aware of the reasons that we study logic. There 
are three very significant reasons. First, the truth tables we studied tell 
us the exact meanings of the words such as “and,” “or”, “not” and so on. 
For instance, whenever we use or read the “If..., then” construction in 
a mathematical context, logic tells us exactly what is meant. Second, 
the rules of inference provide a system in which we can produce new 
information (statements) from known information. Finally, logical rules 
such as DeMorgan’s laws help us correctly change certain statements into 
(potentially more useful) statements with the same meaning. Thus logic 
helps us understand the meanings of statements and it also produces new 
meaningful statements. 

Logic is the glue that holds strings of statements together and pins down 
the exact meaning of certain key phrases such as the “/f..., then” or “For 
all” constructions. Logic is the common language that all mathematicians 
use, so we must have a firm grip on it in order to write and understand 
mathematics. 

But despite its fundamental role, logic’s place is in the background of 
what we do, not the forefront. From here on, the beautiful symbols A, v, 
>, <, ~, V and J are rarely written. But we are aware of their meanings 
constantly. When reading or writing a sentence involving mathematics we 
parse it with these symbols, either mentally or on scratch paper, so as to 
understand the true and unambiguous meaning. 


CHAPTER 3 


Counting 


t may seem peculiar that a college-level text has a chapter on counting. 
At its most basic level, counting is a process of pointing to each object 
in a collection and calling off “one, two, three,...” until the quantity of 
objects is determined. How complex could that be? Actually, counting 
can become quite subtle, and in this chapter we explore some of its more 
sophisticated aspects. Our goal is still to answer the question “How many?” 
but we introduce mathematical techniques that bypass the actual process 
of counting individual objects. 
Almost every branch of mathematics uses some form of this “sophisti- 
cated counting.” Many such counting problems can be modeled with the 
idea of a list, so we start there. 


3.1 Counting Lists 


A list is an ordered sequence of objects. A list is denoted by an opening 
parenthesis, followed by the objects, separated by commas, followed by a 
closing parenthesis. For example (a,b,c,d,e) is a list consisting of the first 
five letters of the English alphabet, in order. The objects a,b,c,d,e are 
called the entries of the list; the first entry is a, the second is b, and so 
on. If the entries are rearranged we get a different list, so, for instance, 


(a,b,c,d,e) #4 (b,a,c,d,e). 


A list is somewhat like a set, but instead of being a mere collection of 
objects, the entries of a list have a definite order. Note that for sets we 
have 

{a,b,c,d,e} = {b,a,c,d,e}, 


but—as noted above—the analogous equality for lists does not hold. 

Unlike sets, lists are allowed to have repeated entries. For example 
(5,3,5,4,3,3) is a perfectly acceptable list, as is (S,O,S). The number of 
entries in a list is called its length. Thus (5,3,5,4,3,3) has length six, and 
(S,O,S) has length three. 
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Occasionally we may get sloppy and write lists without parentheses 
and commas; for instance, we may express (S,O,S) as SOS if there is no 
danger of confusion. But be alert that doing this can lead to ambiguity. 
Is it reasonable that (9,10,11) should be the same as 91011? If so, then 
(9,10,11) = 91011 =(9,1,0,1,1), which makes no sense. We will thus almost 
always adhere to the parenthesis/comma notation for lists. 

Lists are important because many real-world phenomena can be de- 
scribed and understood in terms of them. For example, your phone number 
(with area code) can be identified as a list of ten digits. Order is essential, 
for rearranging the digits can produce a different phone number. A byte is 
another important example of a list. A byte is simply a length-eight list of 
0’s and 1’s. The world of information technology revolves around bytes. 

To continue our examples of lists, (a, 15) is a list of length two. Likewise 
(0,(0,1,1)) is a list of length two whose second entry is a list of length three. 
The list (N,Z,R) has length three, and each of its entries is a set. We 
emphasize that for two lists to be equal, they must have exactly the same 
entries in exactly the same order. Consequently if two lists are equal, then 
they must have the same length. Said differently, if two lists have different 
lengths, then they are not equal. For example, (0,0,0,0,0,0) 4 (0,0,0,0,0). 
For another example note that 


prea 

: mi 

(g,r,0,c,e,r,y, l,i,s,t) Z ( eggs } 
mustard 
coffee 


because the list on the left has length eleven but the list on the right has 
just one entry (a piece of paper with some words on it). 

There is one very special list which has no entries at all. It is called 
the empty list, and is denoted (). It is the only list whose length is zero. 

One often needs to count up the number of possible lists that satisfy 
some condition or property. For example, suppose we need to make a list of 
length three having the property that the first entry must be an element 
of the set {a,b,c}, the second entry must be in {5,7} and the third entry 
must be in {a,x}. Thus (a,5,a) and (b,5,a) are two such lists. How many 
such lists are there all together? To answer this question, imagine making 
the list by selecting the first element, then the second and finally the third. 
This is described in Figure 3.1. The choices for the first list entry are a,b 
or c, and the left of the diagram branches out in three directions, one for 
each choice. Once this choice is made there are two choices (5 or 7) for 
the second entry, and this is described graphically by two branches from 
each of the three choices for the first entry. This pattern continues for the 


Counting Lists 63 


choice for the third entry which is either a or x. Thus, in the diagram there 
are 3-2-2=12 paths from left to right, each corresponding to a particular 
choice for each entry in the list. The corresponding lists are tallied at the 
far-right end of each path. So, to answer our original question, there are 
12 possible lists with the stated properties. 


Resulting list 
first choice second choice third choice i 
L 


(a,5,a) 
(a,5,x) 
(a,7,a) 
(a,7,x) 
(b,5,a) 
(6,5, x) 
(b,7,a) 
(b,7,x) 
(c,5,a) 
(c,5,x) 
(c,7,a) 
(c,7,x) 


Figure 3.1. Constructing lists of length 3 


We summarize the type of reasoning used above in the following im- 
portant fact, which is called the multiplication principle. 


Fact 3.1 (Multiplication Principle) Suppose in making a list of length 
n there are a, possible choices for the first entry, a2 possible choices for 
the second entry, a3 possible choices for the third entry and so on. Then 
the total number of different lists that can be made this way is the product 
dicnsed yas An: 


So, for instance, in the above example we had a, = 3,a2=2 and a3 =2, 
so the total number of lists was aj-a2:a3 =3-2:2=12. Now let’s look at 
some additional examples of how the multiplication principle can be used. 


Example 3.1 A standard license plate consists of three letters followed 
by four numbers. For example, JRB-4412 and MMX-8901 are two standard 
license plates. (Vanity plates such as LV2COUNT are not included among 
the standard plates.) How many different standard license plates are 
possible? 
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To answer this question, note that any standard license plate such as 
JRB-4412 corresponds to a length-7 list (J,R,B,4,4,1,2), so the question 
can be answered by counting how many such lists are possible. We use the 
multiplication principle. There are a; = 26 possibilities (one for each letter 
of the alphabet) for the first entry of the list. Similarly, there are a2 = 26 
possibilities for the second entry and a3 = 26 possibilities for the third 
entry. There are a4=10 possibilities for the fourth entry, and likewise 
a5 =ag=a7=10. Therefore there are a total of a,-ag:a3-a4:a5-ag:a7= 
26-26-26-10-10-10-10= 175,760,000 possible standard license plates. 


There are two types of list-counting problems. On one hand, there are 
situations in which the same symbol or symbols may appear multiple times 
in different entries of the list. For example, license plates or telephone 
numbers can have repeated symbols. The sequence CCX-4144 is a perfectly 
valid license plate in which the symbols C and 4 appear more than once. 
On the other hand, for some lists repeated symbols do not make sense or 
are not allowed. For instance, imagine drawing 5 cards from a standard 
52-card deck and laying them in a row. Since no 2 cards in the deck 
are identical, this list has no repeated entries. We say that repetition is 
allowed in the first type of list and repetition is not allowed in the second 
kind of list. (Often we call a list in which repetition is not allowed a 
non-repetitive list.) The following example illustrates the difference. 


Example 3.2 Consider making lists from symbols A, B, C, D, E, F, G. 
(a) How many length-4 lists are possible if repetition is allowed? 
(b) How many length-4 lists are possible if repetition is not allowed? 


(c) How many length-4 lists are possible if repetition is not allowed and 
the list must contain an E? 


(d) How many length-4 lists are possible if repetition is allowed and the 
list must contain an E? 


Solutions: 


(a) Imagine the list as containing four boxes that we fill with selections 
from the letters A,B,C,D,E,F and G, as illustrated below. 


) 


x? ) ) 
7 choices | 
7 choices 


7 choices 
7 choices 


There are seven possibilities for the contents of each box, so the total 
number of lists that can be made this way is 7-7-7-7= 2401. 
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(b) This problem is the same as the previous one except that repetition is 
not allowed. We have seven choices for the first box, but once it is filled 
we can no longer use the symbol that was placed in it. Hence there are 
only six possibilities for the second box. Once the second box has been 
filled we have used up two of our letters, and there are only five left to 
choose from in filling the third box. Finally, when the third box is filled 
we have only four possible letters for the last box. 


T ) ’ ) 
7 choices | 


6 choices 
5 choices 
4 choices 
Thus the answer to our question is that there are 7-6-5-4= 840 lists in 
which repetition does not occur. 


(c) We are asked to count the length-4 lists in which repetition is not 
allowed and the symbol EF must appear somewhere in the list. Thus E 
occurs once and only once in each such list. Let us divide these lists into 
four categories depending on whether the EF occurs as the first, second, 
third or fourth entry. These four types of lists are illustrated below. 


Type 1 Type 2 Type 3 Type 4 
’ r ’ ’ ’ ) ) ) ’ ’ ) ) ’ 
6 ie 6 choices 6 ies 6 ne 
5 choices 5 choices 5 choices 5 choices 
4 choices 4 choices 4 choices 4 choices 


Consider lists of the first type, in which the E appears in the first entry. 
We have six remaining choices (A,B,C,D,F or G) for the second entry, five 
choices for the third entry and four choices for the fourth entry. Hence 
there are 6-5-4=120 lists having an E in the first entry. As indicated 
in the above diagram, there are also 6-5-4 = 120 lists having an E in the 
second, third or fourth entry. Thus there are 120+ 120+ 120+ 120 = 480 
such lists all together. 


(d) Now we must find the number of length-four lists where repetition 
is allowed and the list must contain an E. Our strategy is as follows. 
By Part (a) of this exercise there are 7-7:7-7= 74 = 2401 lists where 
repetition is allowed. Obviously this is not the answer to our current 
question, for many of these lists contain no E. We will subtract from 
2401 the number of lists that do not contain an E. In making a list that 
does not contain an E, we have six choices for each list entry (because 
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we can choose any one of the six letters A,B,C,D,F or G). Thus there 
are 6-6-6-6= 64 = 1296 lists that do not have an E. Therefore the final 
answer to our question is that there are 2401-1296 = 1105 lists with 
repetition allowed that contain at least one E. 


Perhaps you wondered if Part (d) of Example 3.2 could be solved with 
a setup similar to that of Part (c). Let’s try doing it that way. We want 
to count the length-4 lists (with repetition allowed) that contain at least 
one E. The following diagram is adapted from Part (c), the only difference 
being that there are now seven choices in each slot because we are allowed 
to repeat any of the seven letters. 


Type 1 Type 2 Type 3 Type 4 
) ’ ’ ’ ) ’ ’ ) ’ ’ ’ ’ 
7 choices 7 choices 7 shina 7 chides 
7 choices 7 choices 7 choices 7 choices 
7 choices 7 choices 7 choices 7 choices 


This gives a total of 77+7°+7°+7° = 1372 lists, an answer that is 
substantially larger than the (correct) value of 1105 that we got in our 
solution to Part (d) above. It is not hard to see what went wrong. The 
list (E,E,A,B) is of type 1 and type 2, so it got counted twice. Similarly 
(E,E,C,E) is of type 1, 3 and 4, so it got counted three times. In fact, you 
can find many similar lists that were counted multiple times. 

In solving counting problems, we must always be careful to avoid this 
kind of double-counting or triple-counting, or worse. 


Exercises for Section 3.1 


Note: A calculator may be helpful for some of the exercises in this chapter. This 
is the only chapter for which a calculator may be helpful. (As for the exercises in 
the other chapters, a calculator makes them harder.) 
1. Consider lists made from the letters T,H,E,O,R,Y, with repetition allowed. 
(a) How many length-4 lists are there? 
(b) How many length-4 lists are there that begin with T? 
(c) How many length-4 lists are there that do not begin with T? 
2. Airports are identified with 3-letter codes. For example, the Richmond, Virginia 


airport has the code RIC, and Portland, Oregon has PDX. How many different 
3-letter codes are possible? 


3. How many lists of length 3 can be made from the symbols A,B,C,D,E,F if... 
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10. 


11. 


12. 


(a) ... repetition is allowed. 

(b) ... repetition is not allowed. 

(ec) ... repetition is not allowed and the list must contain the letter A. 
(d) ... repetition is allowed and the list must contain the letter A. 


. Five cards are dealt off of a standard 52-card deck and lined up in a row. How 


many such line-ups are there in which all 5 cards are of the same suit? 


. Five cards are dealt off of a standard 52-card deck and lined up in a row. How 


many such line-ups are there in which all 5 cards are of the same color (i.e., 
all black or all red)? 


. Five cards are dealt off of a standard 52-card deck and lined up in a row. How 


many such line-ups are there in which exactly one of the 5 cards is a queen? 


. This problems involves 8-digit binary strings such as 10011011 or 00001010 


(i.e., 8-digit numbers composed of 0’s and 1’s). 

(a) How many such strings are there? 

(b) How many such strings end in 0? 

(c) How many such strings have the property that their second and fourth 
digits are 1’s? 

(d) How many such strings have the property that their second or fourth digits 
are 1’s? 


. This problem concerns lists made from the symbols A,B,C,D,E. 


(a) How many such length-5 lists have at least one letter repeated? 
(b) How many such length-6 lists have at least one letter repeated? 


. This problem concerns 4-letter codes made from the letters A,B,C,D,...,Z. 


(a) How many such codes can be made? 

(b) How many such codes have no two consecutive letters the same? 

This problem concerns lists made from the letters A,B,C,D,E,F,G,H,LJ. 

(a) How many length-5 lists can be made from these letters if repetition is not 
allowed and the list must begin with a vowel? 

(b) How many length-5 lists can be made from these letters if repetition is not 
allowed and the list must begin and end with a vowel? 

(c) How many length-5 lists can be made from these letters if repetition is not 
allowed and the list must contain exactly one A? 

This problem concerns lists of length 6 made from the letters A,B,C,D,E,F,G,H. 

How many such lists are possible if repetition is not allowed and the list 

contains two consecutive vowels? 

Consider the lists of length six made with the symbols P, R, O, F, S, where 

repetition is allowed. (For example, the following is such a list: (P,R,O,O,F,S).) 

How many such lists can be made if the list must end in an S and the symbol 

O is used more than once? 
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3.2 Factorials 


In working the examples from Section 3.1, you may have noticed that often 
we need to count the number of non-repetitive lists of length n that are 
made from n symbols. In fact, this particular problem occurs with such 
frequency that a special idea, called a factorial, is introduced to handle it. 


n | Symbols Non-repetitive lists of length n made from the symbols n!} 

O | 4} 0) 1 

1 | {A} (A) 1 

2 | {A,B} (A,B),(B, A) 2 

3 | {A,B,C} (A,B,C),(A,C,B),(B,C,A),(B,A,C),(C,A,B),(C,B,A) 6 
(A,B,C,D),(A,B,D,C),(A,C,B,D),(A,C,D,B),(A,D,B,C),(A,D,C,B) 

4 {A B.C D} (B,A,C,D),(B,A,D,C),(B,C,A,D),(B,C,D,A),(B,D,A,C),(B,D,C,A) 24 
a ae: (C,A,B,D),(C,A,D,B),(C,B,A,D),(C,B,D,A),(C,D,A,B),(C,D,B,A) 
(D,A,B,C),(D,A,C.,B),(D,B,A,C),(D,B,C,A),(D,C,A,B),(D,C,B,A) 


The above table motivates this idea. The first column contains succes- 
sive integer values n (beginning with 0) and the second column contains 
a set {A,B,---} of n symbols. The third column contains all the possible 
non-repetitive lists of length n which can be made from these symbols. 
Finally, the last column tallies up how many lists there are of that type. 
Notice that when n= 0 there is only one list of length 0 that can be made 
from 0 symbols, namely the empty list (). Thus the value 1 is entered in 
the last column of that row. 

For n >0, the number that appears in the last column can be computed 
using the multiplication principle. The number of non-repetitive lists of 
length n that can be made from n symbols is n(n—-1)(n—2)---3-2-1. Thus, for 
instance, the number in the last column of the row for n =4 is 4-3-2-1= 24. 


The number that appears in the last column of Row n is called the 
factorial of n. It is denoted as n! (read “n factorial”). Here is the definition: 


Definition 3.1 If n is a non-negative integer, then the factorial of n, 
denoted n!, is the number of non-repetitive lists of length n that can 
be made from n symbols. Thus 0!=1 and 1!=1. If n>1, then n! = 
n(n—1)(n—-2)---8-2-1. 
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It follows that 0! = 
1h = 
2! = 
3! = 
4l = 
5! = 
6! = 6-5-4-3-2-1=720, and so on. 


oP WN BH 


Students are often tempted to say 0!=0, but this is wrong. The correct 
value is 0!=1, as the above definition and table tell us. Here is another 
way to see that 0! must equal 1: Notice that 5!= 5-4-3-2-1= 5-(4-3-2-1)= 
5-4!. Also 4!= 4-3-2-1= 4-(3-2-1)= 4-3!. Generalizing this reasoning, we 
have the following formula. 


ni=n-(n-1)! (3.1) 


Plugging in n =1 gives 1!=1-(1-1)!=1-0!. If we mistakenly thought 0! 
were 0, this would give the incorrect result 1! =0. 
We round out our discussion of factorials with an example. 


Example 3.3 This problem involves making lists of length seven from 
the symbols 0,1,2,3,4,5 and 6. 


(a) How many such lists are there if repetition is not allowed? 


(b) How many such lists are there if repetition is not allowed and the 
first three entries must be odd? 


(c) How many such lists are there in which repetition is allowed, and 
the list must contain at least one repeated number? 


To answer the first question, note that there are seven symbols, so the 
number of lists is 7!=5040. To answer the second question, notice that 
the set {0,1,2,3,4,5,6} contains three odd numbers and four even numbers. 
Thus in making the list the first three entries must be filled by odd numbers 
and the final four must be filled with even numbers. By the multiplication 
principle, the number of such lists is 3-2-1-4-3-2-1= 3!4!=144. 

To answer the third question, notice that there are 7’ = 823,543 lists 
in which repetition is allowed. The set of all such lists includes lists 
that are non-repetitive (e.g., (0,6,1,2,4,3,5)) as well as lists that have 
some repetition (e.g., (6,3,6,2,0,0,0)). We want to compute the number of 
lists that have at least one repeated number. To find the answer we can 
subtract the number of non-repetitive lists of length seven from the total 
number of possible lists of length seven. Therefore the answer is 7’ — 7! = 
823,543 — 5040 = 818, 503. 
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We close this section with a formula that combines the ideas of the first 
and second sections of the present chapter. One of the main problems of 
Section 3.1 was as follows: Given n symbols, how many non-repetitive lists 
of length k can be made from the n symbols? We learned how to apply the 
multiplication principle to obtain the answer 


n(n—1)\(n-2)-:-(n-k+1). 


Notice that by cancellation this value can also be written as 


n(n—-1)\(n-2)---(n-—k+1)\(n-k)\(n-k-1)-::3-2-1 n! 


(n—k)\(n—k-1)---3-2:1 (n—k)! 
We summarize this as follows: 


Fact 3.2 The number of non-repetitive lists of length k whose entries 


5 s . ! 
are chosen from a set of n possible entries is mE 


For example, consider finding the number of non-repetitive lists of 
length five that can be made from the symbols 1,2,3,4,5,6,7,8. We will do 
this two ways. By the multiplication principle, the answer is 8-7-6-5-4= 
6720. Using the formula from Fact 3.2, the answer is oar ee anae6 = 
6720. 

The new formula isn’t really necessary, but it is a nice repackaging of 


an old idea and will prove convenient in the next section. 


Exercises for Section 3.2 


1. What is the smallest n for which n! has more than 10 digits? 
2. For which values of n does n! have n or fewer digits? 


3. How many 5-digit positive integers are there in which there are no repeated 


digits and all digits are odd? 


; ; 100! 
4. Using only pencil and paper, find the value of 9.7; 


5. Using only pencil and paper, find the value of iO 


6. There are two 0’s at the end of 10! = 3,628,800. Using only pencil and paper, 
determine how many 0’s are at the end of the number 100!. 

7. Compute how many 9-digit numbers can be made from the digits 1,2,3,4,5,6,7,8,9 
if repetition is not allowed and all the odd digits occur first (on the left) followed 
by all the even digits (i.e. as in 1375980264, but not 0123456789). 

8. Compute how many 7-digit numbers can be made from the digits 1,2,3,4,5,6,7 if 
there is no repetition and the odd digits must appear in an unbroken sequence. 
(Examples: 3571264 or 2413576 or 2467531, etc., but not 7234615.) 
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9. There is a very interesting function I :[0,oo) — R called the gamma function. 

It is defined as I(x) = [>°t*1e~‘dt. It has the remarkable property that if x<N, 
then I(x) =(x-1)!. Check that this is true for x = 1,2,3,4. 
Notice that this function provides a way of extending factorials to numbers other 
than integers. Since ['(n) =(n—-1)! for all ne N, we have the formula n! = I'(n+1). 
But I can be evaluated at any number in [0,oo), not just at integers, so we 
have a formula for n! for any n€[0,00). Extra credit: Compute z!. 


10. There is another significant function called Stirling’s formula that provides an 
approximation to factorials. It states that n! = V2mn(#)”. It is an approximation 
to n! in the sense that aay approaches 1 as n approaches oo. Use Stirling’s 
formula to find approximations to 5!, 10!, 20! and 50!. 


3.3 Counting Subsets 


The previous two sections were concerned with counting the number of 
lists that can be made by selecting k entries from a set of n possible entries. 
We turn now to a related question: How many subsets can be made by 
selecting k elements from a set with n elements? 

To highlight the differences between these two problems, look at the set 
A = ja,b,c,d,e}. First, think of the non-repetitive lists that can be made 
from selecting two entries from A. By Fact 3.2 (on the previous page), 


there are ay = Ey = 120 = 20 such lists. They are as follows. 


(a,b), (a,c), (a,d), (a,e), (b,c), (b,d), (b,e), (c,d), (c,e) (d,e) 
(b,a), (c,a), (d,a), (e,a), (c,b), (d,b), (e,b), (d,c), (e,c) (e,d) 


Next consider the subsets of A that can made from selecting two ele- 
ments from A. There are only ten such subsets, as follows. 


{a,b}, {a,c}, {a,d}, {a,e}, {b,c}, {b,d}, {b,e}, {c,d}, {c,e}, {d,e}. 


The reason that there are more lists than subsets is that changing the 
order of the entries of a list produces a different list, but changing the 
order of the elements of a set does not change the set. Using elements 
a,be A, we can make two lists (a,b) and (b,a), but only one subset {a,b}. 

In this section we are concerned not with counting lists, but with 
counting subsets. As was noted above, the basic question is this: How 
many subsets can be made by choosing k elements from an n-element 
set? We begin with some notation that gives a name to the answer to this 
question. 
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Definition 3.2 If n and k are integers, then (7) denotes the number 
of subsets that can be made by choosing k elements from a set with n 
elements. The symbol (j) is read “n choose k.” (Some textbooks write 
C(n,k) instead of (7).) 


To illustrate this definition, the following table computes the values of 
(7) for various values of k by actually listing all the subsets of the 4-element 
set A = {a,b,c,d} that have cardinality k. The values of k appear in the 
far-left column. To the right of each & are all of the subsets (if any) of A of 
size k. For example, when k = 1, set A has four subsets of size k, namely 
{a}, {ob}, {c} and {d}. Therefore ({)=4. Similarly, when k = 2 there are six 
subsets of size k so (5) =6. 


k | k-element subsets of {a,b,c,d} ) 

=a) (4) =0 
0 |¢ ()=1 
1 | {a},4},te}.1¢} ()=4 
2 | {a,b}, {a,ch.{a,d}, {b,c}, {b,d},{c,d} | (3)=6 
3 | {a,b,c}, {a,b,d}, {a,c,d}, 1b, c,d} (3)=4 
4 | {a,b,c,d} (J=1 
5 (3) =0 
6 (5) =0 


When k = 0, there is only one subset of A that has cardinality k, namely 
the empty set, d. Therefore (5) = 1. 

Notice that if k is negative or greater than |A|, then A has no subsets 
of cardinality k, so (7) =0 in these cases. In general (') = 0 whenever k <0 
or k>n. In particular this means (7) =0 if n is negative. 


Although it was not hard to work out the values of (;) by writing out 
subsets in the above table, this method of actually listing sets would not 
be practical for computing (7) when n and k are large. We need a formula. 
To find one, we will now carefully work out the value of (3) in such a way 
that a pattern will emerge that points the way to a formula for any ae 
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To begin, note that (3) is the number of 3-element subsets of {a,b,c,d,e}. 
These are listed in the following table. We see that in fact (3) = 10. 


<—_—_—_——— (3) 


{a,b,c} {a,b,d} {a,b,e} {a,c,d} {a,c,e} {a,d,e} {b,c,d} {b,c,e} {b,d,e} {c,d,e} 


The formula will emerge when we expand this table as follows. Taking 
any one of the ten 3-element sets above, we can make 3! different non- 
repetitive lists from its elements. For example, consider the first set {a,b,c}. 
The first column of the following table tallies the 3!=6 different lists that 
can be the letters {a,b,c}. The second column tallies the lists that can be 
made from {a,b,d}, and so on. 


3! 


The final table has (3) columns and 3! rows, so it has a total of 3!(3) lists. 
But notice also that the table consists of every non-repetitive length-3 list 
that can be made from the symbols {a,b,c,d,e}. We know from Fact 3.2 
that there are sar such lists. Thus the total number of lists in the table 
is 31(3) = oa Dividing both sides of this equation by 3!, we get 


5) so! 
3] 35-3)!" 


Working this out, you will find that it does give the correct value of 10. 


But there was nothing special about the values 5 and 3. We could 
do the above analysis for any (') instead of (3). The table would have (7) 
columns and k! rows. We would get 


n\ | n! 
k] kin-k)! 


We summarize this as follows: 
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! 
Fact 3.3 Ifn,keZ and 0<k<n, then " 2 EES Otherwise 4 =) 


Let’s now use our new knowledge to work some exercises. 


Example 3.4 How many 4-element subsets does {1,2,3,4,5,6,7,8,9} have? 


. (9) 9! =! _s«29-8-7-6-5! _ 9-876 _ 9-8-7-6 _ 
The answer is (4) = W@-bl = a= ar = a og 126. 


Example 3.5 A single 5-card hand is dealt off of a standard 52-card deck. 
How many different 5-card hands are possible? 

To answer this, think of the deck as being a set D of 52 cards. Then a 
5-card hand is just a 5-element subset of D. For example, here is one of 
many different 5-card hands that might be dealt from the deck. 


7|| 2) ) 3} ] Al] 5 
& ’ & ? 9 ’ a ? AN 

The total number of possible hands equals the number of 5-element 
subsets of D, that is 


2 2! 52-51-50-49-48-47! 52-51-50-49-4 
5 52!__ 52-51-50-49-48-47! _ 52-51:50-49-48 9 


5] 5-47! 5! 47! 5! 


Thus the answer to our question is that there are 2,598,960 different 
five-card hands that can be dealt from a deck of 52 cards. 


Example 3.6 This problem concerns 5-card hands that can be dealt off 
of a 52-card deck. How many such hands are there in which two of the 
cards are clubs and three are hearts? 

Solution: Think of such a hand as being described by a list of length 


two of the form 
lef le lf ilotle tls): 


where the first entry is a 2-element subset of the set of 18 club cards, and 
the second entry is a 3-element subset of the set of 13 heart cards. There 


are (';’) choices for the first entry and ("}) choices for the second entry, so 


by the multiplication principle there are ('3)("}) = 124) stfu) = 22,308 such 


lists. Answer: There are 22,308 possible 5-card hands with two clubs 
and three hearts. 


Example 3.7 Imagine a lottery that works as follows. A bucket contains 
36 balls numbered 1,2,3,4,...,36. Six of these balls will be drawn randomly. 
For $1 you buy a ticket that has six blanks: OOOO). You fill in the 
blanks with six different numbers between 1 and 36. You win $1,000,000 
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if you chose the same numbers that are drawn, regardless of order. What 
are your chances of winning? 


Solution: In filling out the ticket you are choosing six numbers from 


a set of 36 numbers. Thus there are (2) = een = 1,947,792 different 
combinations of numbers you might write. Only one of these will be a 
winner. Your chances of winning are one in 1,947,792. 


Exercises for Section 3.3 


13. 


14. 


. Suppose a set A has 37 elements. How many subsets of A have 10 elements? 


How many subsets have 30 elements? How many have 0 elements? 


. Suppose A is a set for which |A| = 100. How many subsets of A have 5 elements? 


How many subsets have 10 elements? How many have 99 elements? 


. Aset X has exactly 56 subsets with 3 elements. What is the cardinality of X? 
. Suppose a set B has the property that |{X : X « A(B),|X| =6}| = 28. Find |BI. 
. How many 16-digit binary strings contain exactly seven 1’s? (Examples of such 


strings include 0111000011110000 and 0011001100110010, etc.) 


~ [{X € A{0,1,2,3,4,5,6,7,8, 9}) : |X| = 4}| = 
. |{X © P({0,1,2,3,4,5,6,7,8,9}) : |X] <4}| = 
. This problem concerns lists made from the symbols A,B,C,D,E,F,G,H,I. 


(a) How many length-5 lists can be made if repetition is not allowed and the 
list is in alphabetical order? (Example: BDEFT or ABCGH, but not BACGH.) 

(b) How many length-5 lists can be made if repetition is not allowed and the 
list is not in alphabetical order? 


. This problem concerns lists of length 6 made from the letters A,B,C,D,E,F, 


without repetition. How many such lists have the property that the D occurs 
before the A? 


. Adepartment consists of 5 men and 7 women. From this department you select 


a committee with 3 men and 2 women. In how many ways can you do this? 


. How many positive 10-digit integers contain no 0’s and exactly three 6’s? 
. Twenty-one people are to be divided into two teams, the Red Team and the 


Blue Team. There will be 10 people on Red Team and 11 people on Blue Team. 
In how many ways can this be done? 

Suppose n and & are integers for which 0<k <n. Use the formula (7) = Ro 
to show that (7) = (,,”,). 

Suppose n,ke Z, and 0<k<n. Use Definition 3.2 alone (without using Fact 3.2) 
to show that (7) =(,”,). 
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3.4 Pascal’s Triangle and the Binomial Theorem 


There are some beautiful and significant patterns among the numbers G) 


This section investigates a pattern based on one equation in particular. It 


happens that 
n+1 n n 
vaneond - 


for any integers n and k with 0<k<n. 

To see why this is true, recall that (”;') equals the number of k-element 
subsets of a set with n+1 elements. Now, the set A = {0,1,2,3,...,n} has 
n+1 elements, so C4 equals the number of k-element subsets of A. Such 
subsets can be divided into two types: those that contain 0 and those that 
do not contain 0. To make a k-element subset that contains 0 we can start 
with {0} and then append to this set an additional k - 1 numbers selected 
from {1,2,3,...,n}. There are (,”,) ways to make this selection, so there 
are (,”,) k-element subsets of A that contain 0. Concerning the k-element 
subsets of A that do not contain 0, there are (;) of these sets, for we can 
form them by selecting k elements from the n-element set {1,2,3,...,n$. In 
light of all this, Equation (3.2) just expresses the obvious fact that the 
number of k-element subsets of A equals the number of k-element subsets 


that contain 0 plus the number of k-element subsets that do not contain 0. 


(0) , 1 
5 (0) ‘ (1) P 1 1 
40) a) (2) , 1 2 1 
0 (2) 0), 1 3 3 1 
0) () G) (3) GQ). 1 4 6 4 1 
: (0) Z (7) . (3) - (3) f (4) 6 (:) ; 1 5 1010 5 1 
to) (i) () (3) G(s), 1 6 15 2015 6 1 
(0) G) @ @) @ & @ (7). 1lo7 21 35 35 21 7 1 


Figure 3.2. Pascal’s triangle 


Now that we have seen why Equation (3.2) is true, we are going to 
arrange the numbers (ih) in a triangular pattern that highlights various 
relationships among them. The left-hand side of Figure 3.2 shows numbers 
() arranged in a pyramid with (3) at the apex, just above a row containing 
(,) with k=0 and k=1. Below this is a row listing the values of (7) for 
k =0,1,2. In general, each row listing the numbers (/}) is just above a row 


listing the numbers ("7'). 
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Any number Ce in this pyramid falls immediately below and between 


the positions of the two numbers (,”,) and ({) in the previous row. But 
Equation 3.2 says ("7") = (,”,)+(), and therefore any number (other than 1) 
in the pyramid is the sum of the two numbers immediately above it. 

This pattern is especially evident on the right of Figure 3.2, where the 
values of each (7) have been worked out. Notice how 21 is the sum of the 
two numbers 6 and 15 above it. Similarly, 5 is the sum of the numbers 1 
and 4 above it and so on. 

The arrangement of numbers on the right of Figure 3.2 is called 
Pascal’s triangle. (It is named after Blaise Pascal, 1623-1662, a French 
mathematician and philosopher who discovered many of its properties.) 
Although we have written only the first eight rows (beginning with Row 0 
at the apex) of Pascal’s triangle, it obviously could be extended downward 
indefinitely. We could add an additional row at the bottom by placing a 
1 at each end and obtaining each remaining number by adding the two 
numbers above its position. Doing this would give the following row: 


1 8 28 56 70 56 28 8 1 


This row consists of the numbers (?) for 0<k <8, and we have computed 
them without using the formula (;) = g@gy. Any number (7) can be 
computed in this fashion. 

We call the very top row (containing only 1) Row 0. Row 1 is the next 
row down, followed by Row 2, then Row 8, etc. With this labeling, Row n 


consists of the numbers (7) for O<k <n. 


1 1 
1 1 Ix + ly 
1 2 1 1x2 + 2xy + Ly? 
1 3 3 1 1x3 + 8x2y + Bxy2 + 1y3 
1 4 6 4 1 1x4 + 4x3 y + 6x2y2+ 4xy? + 1y4 


1 5 10 10 5 1 1x5 + Baty +10x3 y2+10x2y3+ 5ay4 + 1y5 


Figure 3.3. The n”” row of Pascal’s triangle lists the coefficients of (x +)” 


Notice that the numbers in Row n of Pascal’s triangle appear to be the 
same as the coefficients of the expansion of (x+ y)”. For example, for n = 2 
we have (x+y)? = 1x2+2xy+1y?, and Row 2 of Pascal’s triangle lists the 
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coefficients 1 2 1. Similarly (x+y)? = 1x°+3x?y+3xy?+1y%, and Row 8 lists 
the coefficients 1 3 3 1. Pascal’s triangle is shown on the left of Figure 3.3, 
and immediately to the right of each Row n is listed the expansion of 
(x+y)”. In every case (at least as far as you care to check) the numbers in 
Row n match up with the coefficients of (x ++)”. 

In fact this turns out to be true for every n. This result is known as 
the binomial theorem, and it is worth mentioning here. It tells how to 
raise a binomial x+y to a non-negative integer power n. 


Theorem 3.1 (Binomial Theorem) If 7n is a non-negative integer, then 
(xt y)" = (Gx? + (Da? ly + (G)x™ ey? + (Bla ByF +--+ (7 Jay? t+ (Py”. 

For now we will be content to accept the binomial theorem without 
proof. (You will be asked to prove it in an exercise in Chapter 10.) But 
until then you may find it useful from time to time. For instance, you 
can apply it if you ever need to expand an expression such as (x +)’. To 
do this, look at Row 7 of Pascal’s triangle in Figure 3.2 and apply the 
binomial Theorem to get 


(x + y)! =x! 4 Tx°y + 21x y? + 35x*y? + 35x° y4 + 21x? y° + Txy® ay, 


Exercises for Section 3.4 


1. Write out Row 11 of Pascal’s triangle. 

2. Use the binomial theorem to find the coefficient of x®y® in (x+y). 

3. Use the binomial theorem to find the coefficient of x® in (x +2)!°. 

4. Use the binomial theorem to find the coefficient of x®y? in (3x—2y)’. 

5. Use the binomial theorem to show wea) a2, 

6. Use Definition 3.2 (page 72) and Fact 1.3 (page 12) to show a) =2", 

7. Use the binomial theorem to show Y7_, 3*(?) =4”. 

8. Use Fact 3.3 (page 74) to derive Equation 3.2 (page 76). 

9. Use the binomial theorem to show (5) - (7) + (3) -(§) + (----+CD"(") =0. 
10. Show that the formula k(‘) = n(77}) is true for all integers n,k with O<k<n. 
11. Use the binomial theorem to show 9” = Y?_,(-1)*(7)10"-*. 

12. Show that (/)(7) = (7) (jm): 
13. Show that (3) = (3) +(3)+(3)+ @) +--+ ("9°)- 
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3.5 Inclusion-Exclusion 


Many counting problems involve computing the cardinality of a union AUB 
of two finite sets. We examine this kind of problem now. 

First we develop a formula for |AUB|. It is tempting to say that |AUB| 
must equal |A|+|B|, but that is not quite right. If we count the elements 
of A and then count the elements of B and add the two figures together, 
we get |A|+|B|. But if A and B have some elements in common, then we 
have counted each element in ANB twice. 


Therefore |A|+|B| exceeds |AUB| by |ANB|, and consequently |A U B| = 
|A|+|B|—|AnB|. This can be a useful equation. 


|A UB| =|A|+|B|—-|AnB| (3.3) 


Notice that the sets A, B and AnB are all generally smaller than AUB, so 
Equation (3.3) has the potential of reducing the problem of determining 
|A UB| to three simpler counting problems. It is sometimes called an 
inclusion-exclusion formula because elements in ANB are included (twice) 
in |A|+|B|, then excluded when |AnB| is subtracted. Notice that if ANB = @, 
then we do in fact get |A UB| =|A|+|B]; conversely if |A UB| =|A|+|BI, then 
it must be that ANB= @. 


Example 3.8 A 3-card hand is dealt off of a standard 52-card deck. How 


many different such hands are there for which all 3 cards are red or all 
three cards are face cards? 


Solution: Let A be the set of 3-card hands where all three cards are 
red (i.e., either Y or >). Let B be the set of 3-card hands in which all three 
cards are face cards (i.e., J,K or Q of any suit). These sets are illustrated 
below. 


A = \ sal sal ‘I =| Hil It. =) al ‘| (Red cards) 
W) iW) 9 iW) © © 9 


B= ‘ mal sal da eal Hil It. sal al a (Face cards) 
& iv) 9 iv) © & 9 


° 
3 


> 
° 
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We seek the number of 3-card hands that are all red or all face cards, 
and this number is |AUB|. By Formula (3.3), |AUB| = |A|+|B|-|An BI. 
Let’s examine |A|,|B| and |AnB| separately. Any hand in A is formed 
by selecting three cards from the 26 red cards in the deck, so |A| = leak 
Similarly, any hand in B is formed by selecting three cards from the 12 
face cards in the deck, so |B| = eee Now think about ANB. It contains all 
the 3-card hands made up of cards that are red face cards. 


A a: ‘all aa J . Ky Al Q eis al bea (Red face 
Vv) V) 9 fio fle > Pld Plo cards) 


The deck has only 6 red face cards, so |ANB| = (8). 

Now we can answer our question. The number of 3-card hands that 
are all red or all face cards is |AUB| =|A|+|B|—-|AnB|= (P)+ ()- (3) = 
2600 + 220 — 20 = 2800. 


d 


° 


There is an analogue to Equation (3.3) that involves three sets. Consider 
three sets A, B and C, as represented in the following Venn Diagram. 


ams 
CY 


Using the same kind of reasoning that resulted in Equation (3.3), you can 
convince yourself that 


|AUBUC|=|A|+|B]+|C|-|AnNB|-|ANC|-|BnNC|+|AnNBnC|. (3.4) 


There’s probably not much harm in ignoring this one for now, but if you 
find this kind of thing intriguing you should definitely take a course in 
combinatorics. (Ask your instructor!) 

As we’ve noted, Equation (3.3) becomes |A UB|=|A|+|B| if it happens 
that ANB=4@. Also, in Equation (3.4), note that if ANB=%, ANC=@ and 
BoC=@, we get the simple formula |A UBUC|=|A|+|B|+|C|. In general, 
we have the following formula for n sets, none of which overlap. It is 
sometimes called the addition principle. 


Fact 3.4 (Addition Principle) If A;,A29,...,A, are sets with A;nA;=@ 
whenever i 4 /, then |A, UAgU-:-UA,| = |Az|+|Ag]l+---+|Agl. 
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Example 3.9 How many 7-digit binary strings (0010100, 1101011, etc.) 
have an odd number of 1’s? 

Solution: Let A be the set of all 7-digit binary strings with an odd 
number of 1’s, so the answer to the question will be |A|. To compute |A|, 
we break A up into smaller parts. Notice any string in A will have either 
one, three, five or seven 1’s. Let A; be the set of 7-digit binary strings 
with only one 1. Let Ag be the set of 7-digit binary strings with three 1’s. 
Let As be the set of 7-digit binary strings with five 1’s, and let A7 be the 
set of 7-digit binary strings with seven 1’s. Therefore A = A;UA3UA5UAz7. 
Notice that any two of the sets A; have empty intersection, so Fact 3.4 
gives |A| =|A1|+|Ag|+|A5|+|Az7]. 

Now the problem is to find the values of the individual terms of this 
sum. For instance take A3, the set of 7-digit binary strings with three 1’s. 
Such a string can be formed by selecting three out of seven positions for 
the 1’s and putting 0’s in the other spaces. Therefore |A3| = Ge Similarly 
[Ail = (7), |As!= (J), and |Az|= (4). Finally the answer to our question is 
|Al = |Ai|+1Asl+lAsi+lAzl= (f+ (3+ (+ (7) = 7+35+21+1=64. There 
are 64 seven-digit binary strings with an odd number of 1’s. 


You may already have been using the Addition Principle intuitively, 
without thinking of it as a free-standing result. For instance, we used it 
in Example 3.2(c) when we divided lists into four types and computed the 
number of lists of each type. 


Exercises for Section 3.5 


1. At a certain university 523 of the seniors are history majors or math majors 
(or both). There are 100 senior math majors, and 33 seniors are majoring in 
both history and math. How many seniors are majoring in history? 

2. How many 4-digit positive integers are there for which there are no repeated 
digits, or for which there may be repeated digits, but all are odd? 

3. How many 4-digit positive integers are there that are even or contain no 0’s? 

4, This problem involves lists made from the letters T,H,E,O,R,Y, with repetition 
allowed. 

(a) How many 4-letter lists are there that don’t begin with T, or don’t end in 
Y? 

(b) How many 4-letter lists are there in which the sequence of letters T,H,E 
appears consecutively? 

(c) How many 5-letter lists are there in which the sequence of letters T.H,E 
appears consecutively? 
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10. 


. How many 7-digit binary strings begin in 1 or end in 1 or have exactly four 1’s? 
. Is the following statement true or false? Explain. If Aj; nN A2gnA3 = @, then 


|AyUAgUAg3/ = |Ai|/+|Ag|+]Agl. 


. This problem concerns 4-card hands dealt off of a standard 52-card deck. How 


many 4-card hands are there for which all 4 cards are of the same suit or all 4 
cards are red? 


. This problem concerns 4-card hands dealt off of a standard 52-card deck. How 


many 4-card hands are there for which all 4 cards are of different suits or all 4 
cards are red? 


. A 4-letter list is made from the letters L,I,S,T,E,D according to the following 


rule: Repetition is allowed, and the first two letters on the list are vowels or 
the list ends in D. How many such lists are possible? 


A 5-card poker hand is called a flush if all cards are the same suit. How many 
different flushes are there? 


Part Il 


How to Prove Conditional 
Statements 


CHAPTER 4 


Direct Proof 


t is time to prove some theorems. There are various strategies for doing 

this; we now examine the most straightforward approach, a technique 

called direct proof. As we begin, it is important to keep in mind the 
meanings of three key terms: Theorem, proof and definition. 

A theorem is a mathematical statement that is true and can be (and 
has been) verified as true. A proof of a theorem is a written verification 
that shows that the theorem is definitely and unequivocally true. A proof 
should be understandable and convincing to anyone who has the requisite 
background and knowledge. This knowledge includes an understanding of 
the meanings of the mathematical words, phrases and symbols that occur 
in the theorem and its proof. It is crucial that both the writer of the proof 
and the readers of the proof agree on the exact meanings of all the words, 
for otherwise there is an intolerable level of ambiguity. A definition is an 
exact, unambiguous explanation of the meaning of a mathematical word or 
phrase. We will elaborate on the terms theorem and definition in the next 
two sections, and then finally we will be ready to begin writing proofs. 


4.1 Theorems 


A theorem is a statement that is true and has been proved to be true. 
You have encountered many theorems in your mathematical education. 
Here are some theorems taken from an undergraduate calculus text. They 
will be familiar to you, though you may not have read all the proofs. 


Theorem: Let f be differentiable on an open interval J and let ce I. 
If f(c) is the maximum or minimum value of f on J, then f'(c) =0. 


Theorem: If 1° ,a; converges, then limp .,.a; = 0. 


Theorem: Suppose f is continuous on the interval [a,b]. Then f is 
integrable on [a,b]. 


Theorem: Every absolutely convergent series converges. 
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Observe that each of these theorems either has the conditional form “If 
P, then Q,” or can be put into that form. The first theorem has an initial 
sentence “Let f be differentiable on an open interval I, and let ce I,” which 
sets up some notation, but a conditional statement follows it. The third 
theorem has form “Suppose P. Then Q,” but this means the same thing 
as “If P, then Q.” The last theorem can be re-expressed as “If a series is 
absolutely convergent, then it is convergent.” 

A theorem of form “/f P, then Q,” can be regarded as a device that 
produces new information from P. Whenever we are dealing with a 
situation in which P is true, then the theorem guarantees that, in addition, 
Q is true. Since this kind of expansion of information is useful, theorems 
of form “If P, then Q,” are very common. 

But not every theorem is a conditional statement. Some have the form 
of the biconditional P = Q, but, as we know, that can be expressed as two 
conditional statements. Other theorems simply state facts about specific 
things. For example, here is another theorem from your study of calculus. 


Theorem: The series 1+ 5 + 5 + i + Z +--- diverges. 


It would be difficult (or at least awkward) to restate this as a conditional 
statement. Still, it is true that most theorems are conditional statements, 
so much of this book will concentrate on that type of theorem. 

It is important to be aware that there are a number of words that 
mean essentially the same thing as the word “theorem,” but are used in 
slightly different ways. In general the word “theorem” is reserved for a 
statement that is considered important or significant (the Pythagorean 
theorem, for example). A statement that is true but not as significant 
is sometimes called a proposition. A lemma is a theorem whose main 
purpose is to help prove another theorem. A corollary is a result that is 
an immediate consequence of a theorem or proposition. It is not important 
that you remember all these words now, for their meanings will become 
clear with usage. 

Our main task is to learn how to prove theorems. As the above examples 
suggest, proving theorems requires a clear understanding of the meaning 
of the conditional statement, and that is the primary reason we studied it 
so extensively in Chapter 2. In addition, it is also crucial to understand 
the role of definitions. 
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4.2 Definitions 


A proof of a theorem should be absolutely convincing. Ambiguity must be 
avoided. Everyone must agree on the exact meaning of each mathematical 
term. In Chapter 1 we defined the meanings of the sets N, Z, R, Q and 
, as well as the meanings of the symbols € and ¢, and we shall make 
frequent use of these things. Here is another definition that we use often. 


Definition 4.1 An integer n is even if n = 2a for some integer ae Z. 


Thus, for example, 10 is even because 10=2-5. Also, according to the 
definition, 7 is not even because there is no integer a for which 7 = 2a. 
While there would be nothing wrong with defining an integer to be odd if 
it’s not even, the following definition is more concrete. 


Definition 4.2 An integer n is odd if n =2a+1 for some integer ae Z. 


Thus 7 is odd because 7 = 2:3+1. We will use these definitions whenever 
the concept of even or odd numbers arises. If in a proof a certain number 
turns out to be even, the definition allows us to write it as 2a for an 
appropriate integer a. If some quantity has form 2b+1 where b is an 
integer, then the definition tells us the quantity is odd. 


Definition 4.3 Two integers have the same parity if they are both even 
or they are both odd. Otherwise they have opposite parity. 


Thus 5 and —17 have the same parity, as do 8 and 0; but 3 and 4 have 
opposite parity. 

Two points about definitions are in order. First, in this book the word 
or term being defined appears in boldface type. Second, it is common to 
express definitions as conditional statements even though the biconditional 
would more appropriately convey the meaning. Consider the definition 
of an even integer. You understand full well that if n is even then n = 2a 
(for a€ Z), and if n = 2a, then n is even. Thus, technically the definition 
should read “An integer n is even if and only if n = 2a for some aé€ Z.” 
However, it is an almost-universal convention that definitions are phrased 
in the conditional form, even though they are interpreted as being in the 
biconditional form. There is really no good reason for this, other than 
economy of words. It is the standard way of writing definitions, and we 
have to get used to it. 

Here is another definition that we will use often. 
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Definition 4.4 Suppose a and b are integers. We say that a divides b, 
written a|b, if b=ac for some ce Z. In this case we also say that a isa 
divisor of 6, and that } is a multiple of a. 


For example, 5 divides 15 because 15=5-3. We write this as 5| 15. 
Similarly 8 | 32 because 32 = 8-4, and —6|6 because 6 = —-6-—1. However, 
6 does not divide 9 because there is no integer c for which 9=6-c. We 
express this as 6/9, which we read as “6 does not divide 9.” 

Be careful of your interpretation of the symbols. There is a big difference 
between the expressions a|b and a/b. The expression a|b is a statement, 
while a/b is a fraction. For example, 8|16 is true and 8|20 is false. By 
contrast, 8/16=0.5 and 8/20=0.4 are numbers, not statements. Be careful 
not to write one when you mean the other. 

Every integer has a set of integers that divide it. For example, the set 
of divisors of 6 is {ae Z:a|6} = {-6,—-3,-2,-1,1,2,3,6}. The set of divisors 
of 5 is {-5,-1,1,5}. The set of divisors of 0 is Z. This brings us to the 
following definition, with which you are already familiar. 


Definition 4.5 A natural number n is prime if it has exactly two positive 
divisors, 1 and n. 


For example, 2 is prime, as are 5 and 17. The definition implies that 1 
is not prime, as it only has one (not two) positive divisor, namely 1. An 
integer n is composite if it factors as n =ab where a,b > 1. 


Definition 4.6 The greatest common divisor of integers a and }, 
denoted gcd(a,b), is the largest integer that divides both a and b. The 
least common multiple of non-zero integers a and b, denoted lcm(a, b), 
is smallest positive integer that is a multiple of both a and b. 


So ged(18,24)=6, gced(5,5) =5 and ged(32,-8)=8. Also ged(50, 18) = 2, 
but gced(50,9)= 1. Note that gcd(0,6) =6, because, although every integer 
divides 0, the largest divisor of 6 is 6. 

The expression gcd(0,0) is problematic. Every integer divides 0, so the 
only conclusion is that gcd(0,0) = 00. We circumvent this irregularity by 
simply agreeing to consider gcd(a,b) only when a and b are not both zero. 

Continuing our examples, lcm(4,6) = 12, and lcm(7,7) = 7. 

Of course not all terms can be defined. If every word in a definition were 
defined, there would be separate definitions for the words that appeared 
in those definitions, and so on, until the chain of defined terms became 
circular. Thus we accept some ideas as being so intuitively clear that 
they require no definitions or verifications. For example, we will not find 
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it necessary to define what an integer (or a real number) is. Nor will 
we define addition, multiplication, subtraction and division, though we 
will use these operations freely. We accept and use such things as the 
distributive and commutative properties of addition and multiplication, as 
well as other standard properties of arithmetic and algebra. 

As mentioned in Section 1.9, we accept as fact the natural ordering 
of the elements of N,Z,Q and R, so that (for example) statements such as 
“5 <7,” and “x < y implies —x >—y,” do not need to be justified. 

In addition, we accept the following fact without justification or proof. 


Fact 4.1 Suppose a and b are integers. Then: 
e atbeZ 

e a-beZ 

e abeZ 


These three statements can be combined. For example, we see that if a,b 
and ¢ are integers, then a2b-—ca+b is also an integer. 


We will also accept as obvious the fact that any integer a can be divided 
by a non-zero integer b, resulting in a unique quotient g and remainder r. 
For example, b = 8 goes into a=17 q=5 times with remainder r=2. In 
symbols, 17=4-5+2, ora=qb+r. This fact, called the division algorithm, 
was mentioned on page 28. 


(The Division Algorithm) Given integers a and 6 with b > 0, there exist 
unique integers g and r for which a=qb+r and 0<r<ob. 


Another fact that we will accept without proof (at least for now) is 
that every natural number greater than 1 has a unique factorization into 
primes. For example, the number 1176 can be factored into primes as 
1176 = 2-2-2-3-7-7 = 2°-3-77. By unique we mean that any factorization 
of 1176 into primes will have exactly the same factors (i.e., three 2’s, one 3 
and two 7’s). Thus, for example, there is no valid factorization of 1176 that 
has a factor of 5. You may be so used to factoring numbers into primes 
that it seems obvious that there cannot be different prime factorizations 
of the same number, but in fact this is a fundamental result whose proof 
is not transparent. Nonetheless, we will be content to assume that every 
natural number greater than 1 has a unique factorization into primes. 
(We will revisit the issue of a proof in Section 10.2.) 


We will introduce other accepted facts, as well as definitions, as needed. 
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4.3 Direct Proof 


This section explains a simple way to prove theorems or propositions 
that have the form of conditional statements. The technique is called 
direct proof. To simplify the discussion, our first examples will involve 
proving statements that are almost obviously true. Thus we will call the 
statements propositions rather than theorems. (Remember, a proposition 
is a statement that, although true, is not as significant as a theorem.) 

To understand how the technique of direct proof works, suppose we 
have some proposition of the following form. 


Proposition If P, then Q. 


This proposition is a conditional statement of form P > Q. Our goal 
is to show that this conditional statement is true. To see how to proceed, 
look at the truth table. 


P|Q|P>Q 
T|T T 
T|F F 
F\T T 
F\F T 


The table shows that if P is false, the statement P > Q is automatically 
true. This means that if we are concerned with showing P > Q is true, we 
don’t have to worry about the situations where P is false (as in the last 
two lines of the table) because the statement P > Q will be automatically 
true in those cases. But we must be very careful about the situations 
where P is true (as in the first two lines of the table). We must show that 
the condition of P being true forces Q to be true also, for that means the 
second line of the table cannot happen. 

This gives a fundamental outline for proving statements of the form 
P=>Q. Begin by assuming that P is true (remember, we don’t need to worry 
about P being false) and show this forces Q to be true. We summarize this 
as follows. 


Outline for Direct Proof 
Proposition If P, then Q. 


Proof. Suppose P. 


Therefore Q. | 


Direct Proof 91 


So the setup for direct proof is remarkably simple. The first line 
of the proof is the sentence “Suppose P.” The last line is the sentence 
“Therefore Q.” Between the first and last line we use logic, definitions and 
standard math facts to transform the statement P to the statement Q. It 
is common to use the word “Proof” to indicate the beginning of a proof, 
and the symbol m to indicate the end. 

As our first example, let’s prove that if x is odd then x? is also odd. 
(Granted, this is not a terribly impressive result, but we will move on to 
more significant things in due time.) The first step in the proof is to fill 
in the outline for direct proof. This is a lot like painting a picture, where 
the basic structure is sketched in first. We leave some space between the 
first and last line of the proof. The following series of frames indicates the 
steps you might take to fill in this space with a logical chain of reasoning. 


Proposition If x is odd, then x? is odd. 


Proof. Suppose x is odd. 


Therefore x2 is odd. | 


Now that we have written the first and last lines, we need to fill in the 
space with a chain of reasoning that shows that x being odd forces x? to 
be odd. 

In doing this it’s always advisable to use any definitions that apply. 
The first line says x is odd, and by Definition 4.2 it must be that x =2a+1 
for some a€ Z, so we write this in as our second line. 


Proposition If x is odd, then x? is odd. 


Proof. Suppose x is odd. 
Then x =2a+1 for some aé€ Z, by definition of an odd number. 


Therefore x2 is odd. | 


Now jump down to the last line, which says x? is odd. Think about what 
the line immediately above it would have to be in order for us to conclude 
that x” is odd. By the definition of an odd number, we would have to have 
x? =2a+1 for some a€ Z. However, the symbol a now appears earlier in 
the proof in a different context, so we should use a different symbol, say 0. 
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Proposition If x is odd, then x? is odd. 


Proof. Suppose x is odd. 
Then x = 2a+1 for some aé Z, by definition of an odd number. 


Thus x? =2b+1 for an integer b. 
Therefore x? is odd, by definition of an odd number. a 


We are almost there. We can bridge the gap as follows. 


Proposition If x is odd, then x? is odd. 


Proof. Suppose x is odd. 

Then x =2a+1 for some aé Z, by definition of an odd number. 
Thus x” = (2a +1)" = 4a? + 4a +1 = 2(2a? + 2a)+1. 

So x? =2b+1 where b is the integer b = 2a? + 2a. 

Thus x? = 2b+1 for an integer b. 

Therefore x? is odd, by definition of an odd number. a 


Finally, we may wish to clean up our work and write the proof in paragraph 
form. Here is our final version. 


Proposition If x is odd, then x? is odd. 


Proof. Suppose x is odd. Then x = 2a+1 for some a € Z, by definition 
of an odd number. Thus x? = (2a+ 1)? = 4a7+4a+1 = 2(2a7+2a)+1, so 
x? =2b+1 where b = 2a7+2a€Z. Therefore x” is odd, by definition 
of an odd number. a 


At least initially, it’s generally a good idea to write the first and last line 
of your proof first, and then fill in the gap, sometimes jumping alternately 
between top and bottom until you meet in the middle, as we did above. This 
way you are constantly reminded that you are aiming for the statement 
at the bottom. Sometimes you will leave too much space, sometimes not 
enough. Sometimes you will get stuck before figuring out what to do. This 
is normal. Mathematicians do scratch work just as artists do sketches for 
their paintings. 
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Here is another example. Consider proving following proposition. 
Proposition Let a,b and c be integers. If a|b and b|c, then a|c. 


Let’s apply the basic outline for direct proof. To clarify the procedure 
we will write the proof in stages again. 


Proposition Let a,b and c be integers. If a|b and b|c, then a|c. 


Proof. Suppose a|b and b|c. 


Therefore a|c. a 


Our first step is to apply Definition 4.4 to the first line. The definition 
says a|b means b =ac for some integer c, but since c already appears in 
a different context on the first line, we must use a different letter, say d. 
Similarly let’s use a new letter e in the definition of b|c. 


Proposition Let a,b and c be integers. If a|b and b|c, then a|c. 


Proof. Suppose a|b and b|c. 
By Definition 4.4, we know a|b means there is an integer d with b= ad. 
Likewise, b|c means there is an integer e for which c= be. 


Therefore a|c. a 


We have almost bridged the gap. The line immediately above the last line 
should show that a|c. According to Definition 4.4, this line should say 
that c=ax for some integer x. We can get this equation from the lines at 
the top, as follows. 


Proposition Let a,b and c be integers. If a|b and b|c, then a|c. 


Proof. Suppose a|b and b|c. 

By Definition 4.4, we know a|b means there is an integer d with b =ad. 
Likewise, b|c means there is an integer e for which c = be. 

Thus c = be = (ad)e =a(de), so c=ax for the integer x =de. 

Therefore a|c. a 


The next example is presented all at once rather than in stages. 
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Proposition If x is an even integer, then x?—6x+5 is odd. 


Proof. Suppose x is an even integer. 

Then x = 2a for some a€ Z, by definition of an even integer. 

So x?-6x+5 = (2a)*-6(2a)+5 = 4a7-12a+5 = 4a?-12a+4+1 = 2(2a?—6a+2)+1. 
Therefore we have x? -6x+5=2b+1, where b = 2a”-6a+2€Z. 
Consequently x«?-—6x+5 is odd, by definition of an odd number. a 


One doesn’t normally use a separate line for each sentence in a proof, 
but for clarity we will often do this in the first few chapters of this book. 

Our next example illustrates a standard technique for showing two 
quantities are equal. If we can show m <n and n<™m then it follows that 
m=n. In general, the reasoning involved in showing m <n can be quite 
different from that of showing n<m. 

Recall Definition 4.6 of a least common multiple on page 88. 


Proposition If a,b,ceN, then lem(ca,cb)=c-lem(a, bd). 


Proof. Assume a,b,ceN. Let m=Ilem(ca,cb) and n=c-lem(a,b). We will 
show m=n. By definition, lem(a,b) is a multiple of both a and b, so 
Iem(a,b) = ax = by for some x,ye€ Z. From this we see that n= c-lem(a,b) = 
cax =cby is a multiple of both ca and cb. But m = lem(ca,cb) is the smallest 
multiple of both ca and cb. Thus m <n. 

On the other hand, as m =lcm(ca,cb) is a multiple of both ca and cb, 
we have m=cax=cby for some x,ye€ Z. Then tm =ax=by is a multiple of 
both a and b. Therefore lcm(a,b) < im, so c:lem(a,b)<m, that is, n<m. 

We’ve shown m<n and n<m, so m=n. The proof is complete. a 


The examples we’ve looked at so far have all been proofs of statements 
about integers. In our next example, we are going to prove that if x and y 
are positive real numbers for which x < y, then x < \/y. You may feel that 
the proof is not as “automatic” as the proofs we have done so far. Finding 
the right steps in a proof can be challenging, and that is part of the fun. 


Proposition Let x and y be positive numbers. If x<y, then /x< \/y. 


Proof. Suppose x < y. Subtracting y from both sides gives x— y <0. 

This can be written as \/x”— Jy <0. 

Factor this to get (\/x- /y/x+ /y) <0. 

Dividing both sides by the positive number /x+ ,/y produces \/x- \/y <0. 
Adding ,/y to both sides gives x < \/y. a 


Direct Proof 95 


This proposition tells us that whenever x < y, we can take the square 
root of both sides and be assured that \/x < \/y. This can be useful, as we 
will see in our next proposition. 

That proposition will concern the expression 2\/xy <x+y. Notice when 
you substitute random positive values for the variables, the expression is 
true. For example, for x=6 and y=4, the left side is 2V6-4=4vV6 ~ 9.79, 
which is less than the right side 6+4=10. Is it true that 2,\/xy<x+y for 
any positive x and y? How could we prove it? 

To see how, let’s first cast this into the form of a conditional statement: 
If x and y are positive real numbers, then 2,/xy<x+y. The proof begins 
with the assumption that x and y are positive, and ends with 2,/xy<x+y. 
In mapping out a strategy, it can be helpful to work backwards, working 
from 2,/xy <x+y to something that is obviously true. Then the steps can 
be reversed in the proof. In this case, squaring both sides of 2,\/xy<sx+y 
gives us 

Axy s x + 2xy+ y?. 


Now subtract 4xy from both sides and factor. 


x7 — Ixy + y? 


IA 


< (x — y)? 


But this last line is clearly true, since the square of x—y cannot be negative! 
This gives us a strategy for the proof, which follows. 


Proposition If x and y are positive real numbers, then 2,/xy<x+y. 


Proof. Suppose x and y are positive real numbers. 

Then 0<(x-y)?, that is, 0<x?-2xy+y?. 

Adding 4xy to both sides gives 4xy <x? +2xy+y?. 

Factoring yields 4xy <(x+)?. 

Previously we proved that such an inequality still holds after taking the 
square root of both sides; doing so produces 2,/xy<x+y. a 


Notice that in the last step of the proof we took the square root of both 
sides of 4xy < (x+y)? and got \/4xy < V/(x+ y)?, and the fact that this did 
not reverse the symbol < followed from our previous proposition. This is 
an important point. Sometimes the proof of a proposition or theorem uses 
another proposition or theorem (that has already been proved). 
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4.4 Using Cases 


In proving a statement is true, we sometimes have to examine multiple 
cases before showing the statement is true in all possible scenarios. This 
section illustrates a few examples. 

Our examples will concern the expression 1+(-1)"(2n-1). Here isa 
table that shows what we get when plugging in various integers for n. 
Notice that 1+(-1)"(2n -1) is a multiple of 4 in every line. 


1+(-1)"(2n-1) 


Ps) 


anak wd 
(oe) 


Is 1+(-1)"(2n-1) always a multiple of 4? We prove the answer is “yes” 
in our next example. Notice, however, that the expression 1+(-—1)"(2n - 1) 
behaves differently depending on whether n is even or odd, for in the first 
case (—1)” = 1, and in the second (-1)” = —-1. Thus the proof must examine 
these two possibilities separately. 


Proposition If neéN, then 1+(-1)"(2n-1) is a multiple of 4. 


Proof. Suppose neN. 
Then n is either even or odd. Let’s consider these two cases separately. 


Case 1. Suppose n is even. Then n = 2k for some k € Z, and (—1)” = 1. 
Thus 1+(-1)"(2n -1) =1+(1)(2-2k-1) = 4k, which is a multiple of 4. 
Case 2. Suppose n is odd. Then n=2k+1 for some k€ Z, and (-1)” =-1. 
Thus 1+(-1)"(2n -1) = 1-(2(2k+1)-1) = —4k, which is a multiple of 4. 
These cases show that 1+(-1)"(2n -1) is always a multiple of 4. a 


Now let’s examine the flip side of the question. We just proved that 
1+(-1)"(2n—-1) is always a multiple of 4, but can we get every multiple of 4 
this way? The following proposition and proof give an affirmative answer. 
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Proposition Every multiple of 4 has form 1+(-—1)"(2n-1) for some neN. 


Proof. In conditional form, the proposition is as follows: 
If k is a multiple of 4, then there is an néN for which 1+(-1)"(2n-1)=k&. 
What follows is a proof of this conditional statement. 
Suppose & is a multiple of 4. 
This means k = 4a for some integer a. 
We must produce an n€N for which 1+(-1)"(2n-1)=k. 
This is done by cases, depending on whether a is zero, positive or negative. 
Case 1. Suppose a=0. Let n=1. Then 1+(-1)?(2n—-1)=1+4+(-1)1(2-1)=0 
= 4-0 =4a=k. 
Case 2. Suppose a >0. Let n = 2a, which is in N because a is positive. Also 
n is even, so (-1)” =1. Thus 1+(—1)"(2n-1) = 1+(2n-1) = 2n = 2(2a) = 4a =k. 
Case 3. Suppose a <0. Let n =1-2a, which is an element of N because 
a is negative, making 1-2a positive. Also n is odd, so (-1)” =-1. Thus 
1+(-1)"(2n-1) = 1-(2n-1)= 1-(2(1- 2a)- 1) =4a=k. 

The above cases show that no matter whether a multiple 2 = 4a of 4 is 
zero, positive or negative, k = 1+(-1)"(2n-1) for some neEN. | 


4.5 Treating Similar Cases 


Occasionally two or more cases in a proof will be so similar that writing 
them separately seems tedious or unnecessary. Here is an example. 


Proposition If two integers have opposite parity, then their sum is odd. 


Proof. Suppose m and n are two integers with opposite parity. 
We need to show that m+n is odd. This is done in two cases, as follows. 
Case 1. Suppose m is even and n is odd. Thus m= 2a and n= 2b+1 for 
some integers a and b. Therefore m+n =2a+2b+1=2(a+6)+1, which is 
odd (by Definition 4.2). 
Case 2. Suppose m is odd and n is even. Thus m= 2a+1 and n= 26 for 
some integers a and b. Therefore m+n =2a+1+2b=2(a+6)+1, which is 
odd (by Definition 4.2). 

In either case, m+n is odd. a 


The two cases in this proof are entirely alike except for the order in 
which the even and odd terms occur. It is entirely appropriate to just do 
one case and indicate that the other case is nearly identical. The phrase 
“Without loss of generality...” is a common way of signaling that the proof is 
treating just one of several nearly identical cases. Here is a second version 
of the above example. 
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Proposition If two integers have opposite parity, then their sum is odd. 


Proof. Suppose m and n are two integers with opposite parity. 

We need to show that m+n is odd. 

Without loss of generality, suppose m is even and n is odd. 

Thus m = 2a and n=206+1 for some integers a and bd. 

Therefore m+n = 2a+2b+1=2(a+b)+1, which is odd (by Definition 4.2). 


In reading proofs in other texts, you may sometimes see the phrase 
“Without loss of generality” abbreviated as “WLOG.” However, in the 
interest of transparency we will avoid writing it this way. In a similar 
spirit, it is advisable—at least until you become more experienced in proof 
writing—that you write out all cases, no matter how similar they appear 
to be. 

Please check your understanding by doing the following exercises. The 
odd numbered problems have complete proofs in the Solutions section in 
the back of the text. 


Exercises for Chapter 4 


Use the method of direct proof to prove the following statements. 
. If x is an even integer, then x? is even. 

. If x is an odd integer, then x° is odd. 

If a is an odd integer, then a? +3a+5 is odd. 

Suppose x,ye Z. If x and y are odd, then xy is odd. 

. Suppose x,ye Z. If x is even, then xy is even. 

. Suppose a,b,ce Z. Ifa|b and a|c, then a|(b+c). 

. Suppose a,be Z. If a|b, then a?|d?. 

. Suppose a is an integer. If 5|2a, then 5|a. 


CoONaapwnd 


. Suppose a is an integer. If 7|4a, then 7|a. 

. Suppose a and 6 are integers. If a|b, then a| (3b? — 67 +50). 
. Suppose a,b,c,déZ. If a|b and c|d, then ac | bd. 

. IfxeR and 0<x<4, then 45 21. 

. Suppose x,yeER. If x7 +5y = y?2+5x, then x=y or x+y=5. 

. IfneZ, then 5n?+3n+7 is odd. (Try cases.) 


. IfneZ, then n?+3n+4 is even. (Try cases.) 


— it 
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. If two integers have the same parity, then their sum is even. (Try cases.) 


—_ 
a] 


. If two integers have opposite parity, then their product is even. 


pe 
fe.) 


. Suppose x and y are positive real numbers. If x < y, then x? < y?. 
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19. 
20. 
21. 
22. 
23. 
24. 


25. 
26. 
27. 
28. 


Suppose a,b and c are integers. If a?|b and b?|c, then a®|c. 

If a is an integer and a? |a, then ae {-1,0,1}. 

If p is prime and & is an integer for which 0<k < p, then p divides (?). 
If nN, then n? = 2(5)+({). (You may need a separate case for n= 1.) 
If neN, then (°”) is even. 


If ne N and n=2, then the numbers n!+2, n!+3, n!+4, n!+5,..., n!+n are all 
composite. (Thus for any n= 2, one can find n consecutive composite numbers. 
This means there are arbitrarily large “gaps” between prime numbers.) 


If a,b,c€ N and c<b<a, then O°) = (2 2). 
Every odd integer is a difference of two squares. (Example 7 = 4? — 3?, etc.) 
Suppose a,beN. If ged(a,b)>1 then, a| bd or b is not prime. 


If a,b,c eZ, then c-ged(a, b) < ged(ca,cb). 


CHAPTER 5 


Contrapositive Proof 


W:’ now examine an alternative to direct proof called contrapositive 
proof. Like direct proof, the technique of contrapositive proof is 
used to prove conditional statements of the form “If P, then Q.” Although 
it is possible to use direct proof exclusively, there are occasions where 
contrapositive proof is much easier. 


5.1 Contrapositive Proof 


To understand how contrapositive proof works, imagine that you need to 
prove a proposition of the following form. 


Proposition If P, then Q. 


This is a conditional statement of form P >Q. Our goal is to show 
that this conditional statement is true. Recall that in Section 2.6 we 
observed that P > Q is logically equivalent to ~Q =>~ P. For convenience, 
we duplicate the truth table that verifies this fact. 


P/I|Q|~Q/|~P | P>Q|~Q>~P 
T|\|T) F F T T 
T\|F | T F F F 
F\T| F T T T 
F\F| T T T T 


According to the table, statements P>Q and ~Q=>~P are different 
ways of expressing exactly the same thing. The expression ~ Q >~ P is 
called the contrapositive form of P > qQ.! 


1Do not confuse the words contrapositive and converse. Recall from Section 2.4 that the 
converse of P > Q is the statement Q = P, which is not logically equivalent to P>Q. 
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Since P > Q is logically equivalent to ~ Q >~ P, it follows that to prove 
P=4Q is true, it suffices to instead prove that ~ Q >~ P is true. If we were 
to use direct proof to show ~ Q >~ P is true, we would assume ~ Q is true 
use this to deduce that ~ P is true. This in fact is the basic approach of 
contrapositive proof, summarized as follows. 


Outline for Contrapositive Proof 


Proposition If P, then Q. 


Proof. Suppose ~ Q. 


Therefore ~ P. | 


So the setup for contrapositive proof is very simple. The first line of the 
proof is the sentence “Suppose Q is not true.” (Or something to that effect.) 
The last line is the sentence “Therefore P is not true.” Between the first 
and last line we use logic and definitions to transform the statement ~ Q 
to the statement ~ P. 

To illustrate this new technique, and to contrast it with direct proof, 
we now prove a proposition in two ways: first with direct proof and then 
with contrapositive proof. 


Proposition Suppose xéZ. If 7x+9 is even, then x is odd. 


Proof. (Direct) Suppose 7x +9 is even. 

Thus 7x +9 = 2a for some integer a. 

Subtracting 6x+9 from both sides, we get x = 2a-—6x-9. 

Thus x = 2a-6x-—9=2a-6x-10+1=2(a-—3x-5)+1. 

Consequently x = 2b+1, where b=a-3x-5€Z. 

Therefore x is odd. a 


Here is a contrapositive proof of the same statement: 


Proposition Suppose xéZ. If 7x+9 is even, then x is odd. 


Proof. (Contrapositive) Suppose x is not odd. 

Thus x is even, so x = 2a for some integer a. 

Then 7x +9 = 7(2a)+9=14a+8+1=2(7a+4)+1. 

Therefore 7x+9=2b+1, where b is the integer 7a+4. 

Consequently 7x +9 is odd. 

Therefore 7x+9 is not even. a 
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Though the proofs are of equal length, you may feel that the con- 
trapositive proof flowed more smoothly. This is because it is easier to 
transform information about x into information about 7x+9 than the other 
way around. For our next example, consider the following proposition 
concerning an integer x: 


Proposition If x?-—6x+5 is even, then x is odd. 


A direct proof would be problematic. We would begin by assuming that 
x*—6x+5 is even, so x?—6x+5=2a. Then we would need to transform this 
into x= 2b+1 for be Z. But it is not quite clear how that could be done, 
for it would involve isolating an x from the quadratic expression. However 
the proof becomes very simple if we use contrapositive proof. 


Proposition Suppose x¢ Z. If x?-6x+5 is even, then x is odd. 


Proof. (Contrapositive) Suppose x is not odd. 

Thus x is even, so x = 2a for some integer a. 

So x?-6x+5 = (2a)*-6(2a)+5 = 4a2-12a+5 = 4a7-12a+4+1 = 2(2a?—-6a+2)+1. 
Therefore x? —6x+5=2b+1, where b is the integer 2a? —6a +2. 
Consequently x?-—6x+5 is odd. 

Therefore x? —6x+5 is not even. | 


In summary, since x being not odd (~ Q) resulted in x?-—6x+5 being not 
even (~ P), then x?—6x+5 being even (P) means that x is odd (Q). Thus 
we have proved P > Q by proving ~Q =>~ P. Here is another example: 


Proposition Suppose x,yeéR. If y?+ yx? <x°+xy?, then y<x. 


Proof. (Contrapositive) Suppose it is not true that y<x, so y>x. 
Then y—x>0. Multiply both sides of y—x>0 by the positive value x? + y?. 


(y—x\(x? +7) > O(n? + y?) 
yx? + y? — x3 — xy? > 0 


3 


y? + yx? > x +xy" 


Therefore y? + yx? >x?+xy?, so it is not true that y?+ yx? <x? 4+xy?. a 


Proving “If P, then Q,” with the contrapositive approach necessarily 
involves the negated statements ~ P and ~Q. In working with these we 
may have to use the techniques for negating statements (e.g., DeMorgan’s 
laws) discussed in Section 2.10. We consider such an example next. 
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Proposition Suppose x,yeZ. If 5{xy, then 5x and 5/y. 


Proof. (Contrapositive) Suppose it is not true that 5{x and 5/,y. 

By DeMorgan’s law, it is not true that 5{x or it is not true that 5{y. 
Therefore 5|x or 5|y. We consider these possibilities separately. 

Case 1. Suppose 5|x. Then x=5a for some aé Z. 

From this we get xy = 5(ay), and that means 5 | xy. 

Case 2. Suppose 5|y. Then y=5a for some ae Z. 

From this we get xy = 5(ax), and that means 5|xy. 

The above cases show that 5|xy, so it is not true that 5{ xy. a 


5.2 Congruence of Integers 


This is a good time to introduce a new definition. It is not necessarily 
related to contrapositive proof, but introducing it now ensures that we 
have a sufficient variety of exercises to practice all our proof techniques on. 
This new definition occurs in many branches of mathematics, and it will 
surely play a role in some of your later courses. But our primary reason 
for introducing it is that it will give us more practice in writing proofs. 


Definition 5.1 Given integers a and b and an neEN, we say that a and b 
are congruent modulo n if n|(a—0). We express this as a=b (mod n). 
If a and 6 are not congruent modulo n, we write this as a 4b (mod n). 
Example 5.1 Here are some examples: 

1. 9=1 (mod 4) because 4| (9-1). 

. 6=10 (mod 4) because 4] (6-10). 

. 1448 (mod 4) because 4{(14-8). 

. 20=4 (mod 8) because 8 | (20-4). 

. 17=-4 (mod 8) because 3 | (17 -(—4)). 


ao fF WwW DY 


In practical terms, a= 6 (mod n) means that a and b have the same 
remainder when divided by n. For example, we saw above that 6 = 10 
(mod 4) and indeed 6 and 10 both have remainder 2 when divided by 4. 
Also we saw 14#8 (mod 4), and sure enough 14 has remainder 2 when 
divided by 4, while 8 has remainder 0. 

To see that this is true in general, note that if a and b both have the 
same remainder r when divided by n, then it follows that a=kn+r and 
b=¢n+r for some k,@e€Z. Then a-—b= (kn+r)—-(€n+r)= n(k—-2@). But 
a-—b=n(k-@) means n|(a-—b), so a=b (mod n). Conversely, one of the 
exercises for this chapter asks you to show that if a=6 (mod n), then a 
and b have the same remainder when divided by n. 
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We conclude this section with a couple of proofs involving congruence of 
integers, but you will also test your skills with other proofs in the exercises. 


Proposition Let a,b¢ Zand neN. If a=b (mod n), then a? = b? (mod n). 


Proof. We will use direct proof. Suppose a = b (mod n). 

By definition of congruence of integers, this means n |(a—}b). 

Then by definition of divisibility, there is an integer c for which a—b= nc. 
Now multiply both sides of this equation by a+b. 


a-b = ne 
(a—b)\(a+b) = nec(at+b) 
a*-b? = nc(atb) 


Since c(a+b)€ Z, the above equation tells us | (a? —b?). 
According to Definition 5.1, this gives a? = b? (mod n). | 


Let’s pause to consider this proposition’s meaning. It says a = b (mod n) 
implies a? = 6? (mod n). In other words, it says that if integers a and b 
have the same remainder when divided by n, then a? and b? also have 
the same remainder when divided by n. As an example of this, 6 and 10 
have the same remainder (2) when divided by n = 4, and their squares 
36 and 100 also have the same remainder (0) when divided by n= 4. The 
proposition promises this will happen for all a, b and n. In our examples 
we tend to concentrate more on how to prove propositions than on what 
the propositions mean. This is reasonable since our main goal is to learn 
how to prove statements. But it is helpful to sometimes also think about 
the meaning of what we prove. 


Proposition Leta,b,ce ZandneN. Ifa=b (mod n), then ac = bc (mod n). 


Proof. We employ direct proof. Suppose a= 6b (mod n). By Definition 5.1, 
it follows that n|(a—b). Therefore, by definition of divisibility, there exists 
an integer k for which a—b=nk. Multiply both sides of this equation 
by c to get ac—bc=nkc. Thus ac— bc =nt(kc) where kc € Z, which means 
n|(ac— bc). By Definition 5.1, we have ac = bc (mod n). a 


Contrapositive proof seems to be the best approach in the next example, 
since it will eliminate the symbols { and #. 
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Proposition Suppose a,b¢Z and neN. If 12a #12b (mod n), then n{12. 


Proof. (Contrapositive) Suppose n|12, so there is an integer c for which 
12=nc. Now reason as follows. 


12 = ne 
12(a—b) = nc(a—b) 
12a-12b = n(ca-cb) 


Since ca—cbe Z, the equation 12a —12b = n(ca-—cb) implies n | (12a — 126). 
This in turn means 12a = 126 (mod n). a 


5.3 Mathematical Writing 


Now that you have begun writing proofs, it is the right time to address 
issues concerning writing. Unlike logic and mathematics, where there 
is a clear-cut distinction between what is right or wrong, the difference 
between good and bad writing is sometimes a matter of opinion. But there 
are some standard guidelines that will make your writing clearer. Some 
of these are listed below. 


1. Never begin a sentence with a mathematical symbol. The reason 
is that sentences begin with capital letters, but mathematical symbols 
are case sensitive. Since x and X can have entirely different meanings, 
putting such symbols at the beginning of a sentence can lead to ambiguity. 
Following are some examples of bad usage (marked with x) and good 
usage (marked with V). 


A is a subset of B. x 
The set A is a subset of B. v 
x is an integer, so 2x+5 is an integer. x 
Since x is an integer, 2x+5 is an integer. v 
x? —x+2=0 has two solutions. x 
X?-—x+2=0 has two solutions. x (and silly too) 
The equation x? —x+2=0 has two solutions. rid 


2. End each sentence with a period. Do this even when the sentence 
ends with a spa oo or expression. 


Kuler proved that) ~=[[—>; x 
=1 a peP te a 
1 
Kuler proved that ae -= ][ —;. v 
= FR pep l- my 
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Mathematical statements (equations, etc.) are like English phrases that 
happen to contain special symbols, so use normal punctuation. 

3. Separate mathematical symbols and expressions with words. 
Failure to do this can cause confusion by making distinct expressions 
appear to merge into one. Compare the clarity of the following examples. 


Because x?-1=0, x=lorx=-1. x 
Because x? -—1=0, it follows that x=1 or x=—1. ee 
Unlike AUB, ANB equals @. x 
Unlike AUB, the set ANB equals @. v 


4. Avoid misuse of symbols. Symbols such as =, <, ¢, €, etc., are not 
words. While it is appropriate to use them in mathematical expressions, 
they are out of place in other contexts. 


Since the two sets are =, one is a subset of the other. x 
Since the two sets are equal, one is a subset of the other. JV 
The empty set is a ¢ of every set. x 
The empty set is a subset of every set. v 
Since a is odd and x odd = x? odd, a? is odd. x 


Since a is odd and any odd number squared is odd, then a? is odd.V 


5. Avoid using unnecessary symbols. Mathematics is confusing enough 
without them. Don’t muddy the water even more. 


No set X has negative cardinality. x 
No set has negative cardinality. JV 


6. Use the first person plural. In mathematical writing, it is common 
to use the words “we” and “us” rather than “I,” “you” or “me.” It is as if 
the reader and writer are having a conversation, with the writer guiding 
the reader through the details of the proof. 

7. Use the active voice. This is just a suggestion, but the active voice 
makes your writing more lively. 


The value x = 3 is obtained through the division of both sides by 5.x 
Dividing both sides by 5, we get the value x =8. v 


8. Explain each new symbol. In writing a proof, you must explain the 
meaning of every new symbol you introduce. Failure to do this can lead 
to ambiguity, misunderstanding and mistakes. For example, consider 
the following two possibilities for a sentence in a proof, where a and b 
have been introduced on a previous line. 
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Since a|b, it follows that b=ac. x 
Since a|b, it follows that b=ac for some integer c. v 


If you use the first form, then a reader who has been carefully following 
your proof may momentarily scan backwards looking for where the c 
entered into the picture, not realizing at first that it came from the 
definition of divides. 


9. Watch out for “it.” The pronoun “it” can cause confusion when it is 
unclear what it refers to. If there is any possibility of confusion, you 
should avoid the word “it.” Here is an example: 


Since X CY, and 0< |X|, we see that it is not empty. x 
Is “it” X or Y? Either one would make sense, but which do we mean? 
Since X CY, and 0< |X|, we see that Y is not empty. v 


10. Since, because, as for, so. In proofs, it is common to use these 
words as conjunctions joining two statements, and meaning that one 
statement is true and as a consequence the other true. The following 
statements all mean that P is true (or assumed to be true) and as a 
consequence Q is true also. 


Q since P Q because P Q,asP Q, for P P,soQ 
Since P, Q Because P, Q as P,Q 


Notice that the meaning of these constructions is different from that of 
“If P, then Q,” for they are asserting not only that P implies Q, but also 
that P is true. Exercise care in using them. It must be the case that P 
and Q are both statements and that Q really does follow from P. 


xeEN, so Z x 
xeEN, soxeZ v 


11. Thus, hence, therefore consequently. These adverbs precede a 
statement that follows logically from previous sentences or clauses. Be 
sure that a statement follows them. 


Therefore 2k +1. x 
Therefore a = 2k +1. v 
Your mathematical writing will get better with practice. One of the 


best ways to develop a good mathematical writing style is to read other 
people’s proofs. Adopt what works and avoid what doesn’t. 
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Exercises for Chapter 5 


A. Use the method of contrapositive proof to prove the following statements. (In 
each case you should also think about how a direct proof would work. You will 
find in most cases that contrapositive is easier.) 

. Suppose ne Z. If n? is even, then n is even. 

. Suppose ne Z. If n? is odd, then n is odd. 

Suppose a,b « Z. If a2(b? — 2b) is odd, then a and b are odd. 

Suppose a,b,ce Z. If a does not divide bc, then a does not divide b. 

Suppose xe€R. If x«7+5x <0 then x<0. 

Suppose «eR. If x?-x>0 then x>-1. 

. Suppose a,be Z. If both ab and a+b are even, then both a and b are even. 

. Suppose «ER. If x°—4x4+3x3—x?+3x-420, then x>=0. 

. Suppose ne Z. If 3{n?, then 3{n. 

10. Suppose x,y,z¢Z and x40. If x{yz, then x{y and xfz. 


CHNAAPR wD 


11. Suppose x,y eZ. If x?(y+3) is even, then x is even or y is odd. 
12. Suppose a Z. If a? is not divisible by 4, then a is odd. 


4 


13. Suppose xe R. If x°+ 7x? +5x>«4+x2+8, then x>0. 


B. Prove the following statements using either direct or contrapositive proof. 
Sometimes one approach will be much easier than the other. 
14. If a,b€ Z and a and b have the same parity, then 3a+7 and 7b—4 do not. 
15. Suppose x¢ Z. If x?—1 is even, then x is odd. 
16. Suppose xe Z. If x+y is even, then x and y have the same parity. 
17. If n is odd, then 8|(n?-1). 
18. For any a,b€ Z, it follows that (a+b)? =a? +b? (mod 3). 
19. Let a,beZ andneN. If a=b (mod n) and a=c (mod n), then c=b (mod n). 
20. If ae Z and a=1 (mod 5), then a? =1 (mod 5). 
21. Let a,b€Z and neN. If a=b (mod n), then a® = b® (mod n) 
22. Letae Z,neN. Ifa has remainder r when divided by n, then a=r (mod n). 
23. Let a,b,ceZ and neN. If a=b (mod n), then ca=cb (mod n). 
24. If a=b (mod n) and c=d (mod n), then ac= bd (mod n). 
25. If neN and 2” —1 is prime, then n is prime. 
26. If n=2" -1 for KEN, then every entry in Row n of Pascal’s Triangle is odd. 
27. If a=0 (mod 4) or a=1 (mod 4), then ($) is even. 
28. If ne Z, then 4}(n?-3). 
29. If integers a and b are not both zero, then gcd(a,b) = ged(a — b,b). 
30. If a=b(mod n), then ged(a,n) = ged(b,n). 
31. Suppose the division algorithm applied to a and b yields a=qb+r. Then 
gced(a,b) = ged(r,d). 


CHAPTER 6 


Proof by Contradiction 


e now explore a third method of proof: proof by contradiction. 

This method is not limited to proving just conditional statements— 

it can be used to prove any kind of statement whatsoever. The basic idea 
is to assume that the statement we want to prove is false, and then show 
that this assumption leads to nonsense. We are then led to conclude that 
we were wrong to assume the statement was false, so the statement must 
be true. As an example, consider the following proposition and its proof. 


Proposition If a,b«¢Z, then a?— 4b #2. 


Proof. Suppose this proposition is false. 

This conditional statement being false means there exist numbers a and b 
for which a,b € Z is true but a? — 4b #2 is false. 

Thus there exist integers a,b € Z for which | a? — 4b = 2. 
From this equation we get a? = 4b +2 = 2(2b+1), so a? is even. 

Since a? is even, it follows that a is even, so a = 2c for some integer c. 
Now plug a = 2c back into the boxed equation a? — 4b = 2. 

We get (2c)? -—4b = 2, so 4c? 4b =2. Dividing by 2, we get 2c?-2b=1. 
Therefore 1 =2(c?-— 6), and since c? —b € Z, it follows that 1 is even. 

Since we know 1 is not even, something went wrong. 

But all the logic after the first line of the proof is correct, so it must be 
that the first line was incorrect. In other words, we were wrong to assume 
the proposition was false. Thus the proposition is true. a 


You may be a bit suspicious of this line of reasoning, but in the next 
section we will see that it is logically sound. For now, notice that at 
the end of the proof we deduced that 1 is even, which conflicts with our 
knowledge that 1 is odd. In essence, we have obtained the statement 
(1 is odd) ~ (1 is odd), which has the form Ca ~C. Notice that no matter 
what statement C is, and whether or not it is true, the statement CA ~C 
must be false. A statement—like this one—that cannot be true is called a 
contradiction. Contradictions play a key role in our new technique. 
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6.1 Proving Statements with Contradiction 


Let’s now see why the proof on the previous page is logically valid. In 
that proof we needed to show that a statement P : (a,b € Z) > (a? — 4b # 2) 
was true. The proof began with the assumption that P was false, that is 
that ~ P was true, and from this we deduced Ca ~ C. In other words we 
proved that ~ P being true forces CA ~C to be true, and this means that 
we proved that the conditional statement (~ P)=>(C A ~C) is true. To see 
that this is the same as proving P is true, look at the following truth table 
for (~ P)>(CA~C). Notice that the columns for P and (~ P)>(Ca~C) 
are exactly the same, so P is logically equivalent to (~ P)>(C A ~C). 


P\|C]~P | Ca~C](~P)=>(CA~C) 
T|T|F F T 
T|F\F F T 
F|T | T F F 
F\/F | T F F 


Therefore to prove a statement P, it suffices to instead prove the conditional 
statement (~ P)=>(C A~C). This can be done with direct proof: Assume 
~P and deduce CA ~C. Here is the outline: 


Outline for Proof by Contradiction 


Proposition P. 


Proof. Suppose ~ P. 


Therefore CA~C. B 


One slightly unsettling feature of this method is that we may not know 
at the beginning of the proof what the statement C is going to be. In 
doing the scratch work for the proof, you assume that ~ P is true, then 
deduce new statements until you have deduced some statement C and its 
negation ~C. 

If this method seems confusing, look at it this way. In the first line of 
the proof we suppose ~ P is true, that is we assume P is false. But if P is 
really true then this contradicts our assumption that P is false. But we 
haven’t yet proved P to be true, so the contradiction is not obvious. We 
use logic and reasoning to transform the non-obvious contradiction ~ P to 
an obvious contradiction Ca ~C. 
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The idea of proof by contradiction is quite ancient, and goes back at 
least as far as the Pythagoreans, who used it to prove that certain numbers 
are irrational. Our next example follows their logic to prove that V2 is 
irrational. Recall that a number is rational if it equals a fraction of two 
integers, and it is irrational if it cannot be expressed as a fraction of two 
integers. Here is the exact definition. 


Definition 6.1 A real number x is rational if x = for some a,be Z. 
Also, x is irrational if it is not rational, that is if x 4 ¢ for every a,be Z. 


We are now ready to use contradiction to prove that V2 is irrational. 
According to the outline, the first line of the proof should be “Suppose that 
it is not true that V2 is irrational.” But it is helpful (though not mandatory) 
to tip our reader off to the fact that we are using proof by contradiction. 
One standard way of doing this is to make the first line “Suppose for the 
sake of contradiction that it is not true that V2 is irrational." 


Proposition The number v2 is irrational. 


Proof. Suppose for the sake of contradiction that it is not true that V2 is 
irrational. Then v2 is rational, so there are integers a and b for which 


Joe, (6.1) 
b 
Let this fraction be fully reduced; in particular, this means that a and b are 
not both even. (If they were both even, the fraction could be further reduced 
by factoring 2’s from the numerator and denominator and canceling.) 
Squaring both sides of Equation 6.1 gives 2= a and therefore 


a? = 207. (6.2) 


From this it follows that a? is even. But we proved earlier (Exercise 1 on 
page 108) that a? being even implies a is even. Thus, as we know that a 
and 0b are not both even, it follows that b is odd. Now, since a is even there 
is an integer c for which a = 2c. Plugging this value for a into Equation 6.2, 
we get (2c)? = 2b”, so 4c? = 267, and hence b? = 2c”. This means 0? is even, 
so b is even also. But previously we deduced that b is odd. Thus we have 
the contradiction b is even and b is odd. a 


To appreciate the power of proof by contradiction, imagine trying to 
prove that V2 is irrational without it. Where would we begin? What would 
be our initial assumption? There are no clear answers to these questions. 
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Proof by contradiction gives us a starting point: Assume v2 is rational, 
and work from there. 

In the above proof we got the contradiction (b is even) A ~(b is even) 
which has the form CA ~C. In general, your contradiction need not 
necessarily be of this form. Any statement that is clearly false is sufficient. 
For example 2 4 2 would be a fine contradiction, as would be 4|2, provided 
that you could deduce them. 

Here is another ancient example, dating back at least as far as Euclid: 


Proposition There are infinitely many prime numbers. 


Proof. For the sake of contradiction, suppose there are only finitely many 
prime numbers. Then we can list all the prime numbers as pj, p2,p3,...Pn; 
where p, = 2, p2=3,p3=5,p4=7 and so on. Thus p, is the nth and largest 
prime number. Now consider the number a = (p1p2p3-::pn)+1, that is, a is 
the product of all prime numbers, plus 1. Now a, like any natural number, 
has at least one prime divisor, and that means p,; |a for at least one of our 
n prime numbers p;. Thus there is an integer c for which a =cp;, which 
is to say 


(p1P2P3°**Pk-1PkPk+1°**Pn)+1l=cpp. 


Dividing both sides of this by pz; gives us 


1 
(P1P2P3°**PR-1Pk+1°**Pn) + — =6, 
Pk 
so ; 
— =C-(p1p2p3°** Pk-1Pk+1°*' Pn). 
Pk 
The expression on the right is an integer, while the expression on the left 
is not an integer. This is a contradiction. a 


Proof by contradiction often works well in proving statements of the 
form Vx,P(x). The reason is that the proof set-up involves assuming 
~ Vx,P(x), which as we know from Section 2.10 is equivalent to 4x,~ P(x). 
This gives us a specific x for which ~ P(x) is true, and often that is enough 
to produce a contradiction. Here is an example: 


Proposition For every real number x €[0,7/2], we have sinx+cosx => 1. 
Proof. Suppose for the sake of contradiction that this is not true. 


Then there exists an x€[0,z/2] for which sinx + cosx < 1. 
Since x € [0,2/2], neither sinx nor cosx is negative, so 0<sinx+cosx <1. 
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Thus 0? <(sinx +cosx)? < 12, which gives 0? < sin? x + 2sinxcosx+cos? x < 1?. 
As sin?.x+cos?x = 1, this becomes 0 < 1+2sinxcosx <1, so 1+2sinxcosx <1. 
Subtracting 1 from both sides gives 2sinxcosx <0. 

But this contradicts the fact that neither sinx nor cosx is negative. a 


6.2 Proving Conditional Statements by Contradiction 


Since the previous two chapters dealt exclusively with proving conditional 
statements, we now formalize the procedure in which contradiction is used 
to prove a conditional statement. Suppose we want to prove a proposition 
of the following form. 


Proposition If P, then Q. 


Thus we need to prove that P>Q is a true statement. Proof by 
contradiction begins with the assumption that ~ (P > Q) is true, that is, 
that P>Q is false. But we know that P > Q being false means that it is 
possible that P can be true while Q is false. Thus the first step in the 
proof is to assume P and ~@Q. Here is an outline: 


Outline for Proving a Conditional 
Statement with Contradiction 


Proposition If P, then Q. 


Proof. Suppose P and ~Q. 


Therefore CA~C. |_| 


To illustrate this new technique, we revisit a familiar result: If a? is 
even, then a is even. According to the outline, the first line of the proof 
should be “For the sake of contradiction, suppose a? is even and a is not 
even.” 


Proposition Suppose aé Z. If a? is even, then a is even. 


2 is even and a is not even. 


Proof. For the sake of contradiction, suppose a 
Then a? is even, and a is odd. 

Since a is odd, there is an integer c for which a = 2c +1. 
Then a? =(2c +1)? = 4c? + 4c + 1 = 2(2c? + 2c)+1, so a? is odd. 


Thus a? is even and a? is not even, a contradiction. | 
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Here is another example. 


Proposition Ifa,b¢Z and a=2, then a{b or a{(b+1). 


Proof. Suppose for the sake of contradiction there exist a,b € Z with a= 2, 
and for which it is not true that a{b or a{(b+1). 

By DeMorgan’s law, we have a|b and a|(b+1). 

The definition of divisibility says there are c,d € Z with b =ac and b6+1=ad. 
Subtracting one equation from the other gives ad —ac=1, so a(d-c)=1. 
Since a is positive, d—c is also positive (otherwise a(d—c) would be negative). 
Then d-c is a positive integer and a(d-c)=1, soa=1/(d-c)<2. 

Thus we have a=2 and a <2, a contradiction. a 


6.3 Combining Techniques 


Often, especially in more complex proofs, several proof techniques are 
combined within a single proof. For example, in proving a conditional 
statement P > Q, we might begin with direct proof and thus assume P to 
be true with the aim of ultimately showing Q is true. But the truth of Q 
might hinge on the truth of some other statement R which—together with 
P—would imply Q@. We would then need to prove R, and we would use 
whichever proof technique seems most appropriate. This can lead to “proofs 
inside of proofs.” Consider the following result. The overall approach is 
direct, but inside the direct proof is a separate proof by contradiction. 


Proposition Every non-zero rational number can be expressed as a 
product of two irrational numbers. 


Proof. This proposition can be reworded as follows: If r is a non-zero 
rational number, then r is a product of two irrational numbers. In what 
follows, we prove this with direct proof. 

Suppose r is a non-zero rational number. Then r= ¢ for integers a 
and b. Also, r can be written as a product of two numbers as follows: 


r= v2 


We know v2 is irrational, so to complete the proof we must show r/V2 is 
also irrational. 

To show this, assume for the sake of contradiction that r/V/2 is rational. 
This means 
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for integers c and d, so 


Voor, 
c 
But we know r=a/b, which combines with the above equation to give 
d ad ad 
2=r—=--=—. 
c bce be 


This means v2 is rational, which is a contradiction because we know it is 
irrational. Therefore r/V2 is irrational. 
Consequently r = /2-r/V2 is a product of two irrational numbers. 


For another example of a proof-within-a-proof, try Exercise 5 at the 
end of this chapter (or see its solution). Exercise 5 asks you to prove that 
V3 is irrational. This turns out to be slightly trickier than proving that 
V2 is irrational. 


6.4 Some Words of Advice 


Despite the power of proof by contradiction, it’s best to use it only when the 
direct and contrapositive approaches do not seem to work. The reason for 
this is that a proof by contradiction can often have hidden in it a simpler 
contrapositive proof, and if this is the case it’s better to go with the simpler 
approach. Consider the following example. 


Proposition Suppose aé Z. If a?-2a+7 is even, then a is odd. 


Proof. To the contrary, suppose a?—2a+7 is even and a is not odd. 

That is, suppose a? —2a+7 is even and a is even. 

Since a is even, there is an integer c for which a = 2c. 

Then a? - 2a +7 = (2c)* —2(2c)+ 7 = 2(2c? —2c + 3)+1, so a2 -2a+7 is odd. 
Thus a?—2a+7 is both even and odd, a contradiction. a 


Though there is nothing really wrong with this proof, notice that part 
of it assumes a is not odd and deduces that a? -2a+7 is not even. That is 
the contrapositive approach! Thus it would be more efficient to proceed as 
follows, using contrapositive proof. 


Proposition Suppose aé« Z. If a?-2a+7 is even, then a is odd. 
Proof. (Contrapositive) Suppose a is not odd. 
Then a is even, so there is an integer c for which a = 2c. 


Then a? —-2a+7 = (2c)? —2(2c) + 7 = 2(2c? —-2c + 3)+1, so a?-2a+7 is odd. 
Thus a? -2a+7 is not even. | 
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Exercises for Chapter 6 


A. Use the method of proof by contradiction to prove the following statements. 
(In each case, you should also think about how a direct or contrapositive proof 
would work. You will find in most cases that proof by contradiction is easier.) 


1. Suppose ne Z. If n is odd, then n? is odd. 

2. Suppose ne Z. If n? is odd, then n is odd. 

3. Prove that 2 is irrational. 

4. Prove that V6 is irrational. 

5. Prove that V3 is irrational. 

6. If a,be€ Z, then a?-4b-240. 

7. Ifa,beZ, then a2-4b-3#0. 

8. Suppose a,b,cé Z. If a2+b6? =c?, then a or b is even. 

9. Suppose a,beR. If a is rational and abd is irrational, then b is irrational. 
10. There exist no integers a and 6 for which 21a +306 =1. 
11. There exist no integers a and 6 for which 18a + 6b = 1. 


12. For every positive x € Q, there is a positive ye Q for which y<x. 
13. For every x€[z/2,7], sinx—cosx > 1. 

14. If A and B are sets, then An(B-A)=@. 

15. If be Z and b{k for every k EN, then b=0. 

16. If a and 6 are positive real numbers, then a+b >2Vab. 

17. For every ne€ Z, 4{ (n? +2). 

18. Suppose a,b Z. If 4|(a? +7), then a and b are not both odd. 


B. Prove the following statements using any method from Chapters 4, 5 or 6. 


19. The product of any five consecutive integers is divisible by 120. (For 
example, the product of 3,4,5,6 and 7 is 2520, and 2520 = 120-21.) 

20. We say that a point P = (x,y) in R? is rational if both x and y are rational. 
More precisely, P is rational if P =(x,y)¢ Q?. An equation F(x, y)=0 is said 
to have a rational point if there exists xo, yo € Q such that F(xo, yo) = 0. For 
example, the curve x7 +y*-—1=0 has rational point (xo, yo) = (1,0). Show that 
the curve x? + y?—3=0 has no rational points. 

21. Exercise 20 (above) involved showing that there are no rational points on 
the curve x? + y?—3=0. Use this fact to show that V3 is irrational. 

22. Explain why x? +y?-—3=0 not having any rational solutions (Exercise 20) 
implies x? + y?-3* =0 has no rational solutions for k an odd, positive integer. 

23. Use the above result to prove that 3 is irrational for all odd, positive k. 

24, The number log, 3 is irrational. 
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More on Proof 


CHAPTER 7 


Proving Non-Conditional Statements 


he last three chapters introduced three major proof techniques: direct, 

contrapositive and contradiction. These three techniques are used to 
prove statements of the form “If P, then Q.” As we know, most theorems 
and propositions have this conditional form, or they can be reworded to 
have this form. Thus the three main techniques are quite important. But 
some theorems and propositions cannot be put into conditional form. For 
example, some theorems have form “P if and only if Q.” Such theorems 
are biconditional statements, not conditional statements. In this chapter 
we examine ways to prove them. In addition to learning how to prove 
if-and-only-if theorems, we will also look at two other types of theorems. 


7.1 If-and-Only-If Proof 


Some propositions have the form 
P if and only if Q. 


We know from Section 2.4 that this statement asserts that both of the 
following conditional statements are true: 


If P, then Q. 
If Q, then P. 


So to prove “P if and only if Q,” we must prove two conditional statements. 
Recall from Section 2.4 that Q = P is called the converse of P=>Q. Thus 
we need to prove both P > Q and its converse. Since these are both condi- 
tional statements we may prove them with either direct, contrapositive or 
contradiction proof. Here is an outline: 


Outline for If-and-Only-If Proof 
Proposition P if and only if Q. 


Proof. 
[Prove P>Q using direct, contrapositive or contradiction proof. ] 
[Prove Q > P using direct, contrapositive or contradiction proof.] m 
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Let’s start with a very simple example. You already know that an 
integer n is odd if and only if n? is odd, but let’s prove it anyway, just 
to illustrate the outline. In this example we prove (n is odd)=>(n? is odd) 
using direct proof and (n? is odd)=>(n is odd) using contrapositive proof. 


Proposition The integer n is odd if and only if n? is odd. 


Proof. First we show that n being odd implies that n? is odd. Suppose n 
is odd. Then, by definition of an odd number, n = 2a +1 for some integer a. 
Thus n? = (2a +1)? = 4a7+4a+1= 2(2a7+2a)+1. This expresses n” as twice 
an integer, plus 1, so n? is odd. 

Conversely, we need to prove that n? being odd implies that n is odd. 
We use contrapositive proof. Suppose n is not odd. Then n is even, so 
n = 2a for some integer a (by definition of an even number). Thus n? = 
(2a)? = 2(2a), so n? is even because it’s twice an integer. Thus n? is not 
odd. We’ve now proved that if n is not odd, then n? is not odd, and this is 
a contrapositive proof that if n? is odd then n is odd. a 


In proving “P if and only if Q,” you should begin a new paragraph 
when starting the proof of @ > P. Since this is the converse of P > Q, it’s 
a good idea to begin the paragraph with the word “Conversely” (as we did 
above) to remind the reader that you’ve finished the first part of the proof 
and are moving on to the second. Likewise, it’s a good idea to remind the 
reader of exactly what statement that paragraph is proving. 

The next example uses direct proof in both parts of the proof. 


Proposition Suppose a and bd are integers. Then a =b (mod 6) if and 
only if a=b (mod 2) and a=b (mod 3). 


Proof. First we prove that if a =b (mod 6), then a=b (mod 2) and a=b 
(mod 3). Suppose a = b (mod 6). This means 6|(a—b), so there is an integer 
n for which 

a-—b=6n. 


From this we get a—b = 2(3n), which implies 2|(a—b), soa=b6 (mod 2). But 
we also get a—b = 3(2n), which implies 3|(a—b), so a=b (mod 3). Therefore 
a=b (mod 2) and a=b (mod 3). 

Conversely, suppose a = b (mod 2) and a= b (mod 3). Since a = b (mod 2) 
we get 2|(a—b), so there is an integer k for which a—b = 2k. Therefore a—b 
is even. Also, from a=b (mod 3) we get 3|(a—b), so there is an integer @ 
for which 

a-b=388. 
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But since we know a~— 0 is even, it follows that ? must be even also, for 
if it were odd then a-—b = 3¢ would be odd (because a-b would be the 
product of two odd integers). Hence ¢ = 2m for some integer m. Thus 
a—-b=30=3-:2m=6m. This means 6|(a—b), so a=b (mod 6). | 


Since if-and-only-if proofs simply combine methods with which we are 
already familiar, we will not do any further examples in this section. 
However, it is of utmost importance that you practice your skill on some 
of this chapter’s exercises. 


7.2 Equivalent Statements 


In other courses you will sometimes encounter a certain kind of theorem 
that is neither a conditional nor a biconditional statement. Instead, it 
asserts that a list of statements is “equivalent.” You saw this (or will see 
it) in your linear algebra textbook, which featured the following theorem: 


Theorem Suppose A is an nxn matrix. The following statements are 
equivalent: 


(a) The matrix A is invertible. 

(b) The equation Ax=b has a unique solution for every be R”. 
(c) The equation Ax=0 has only the trivial solution. 

(d) The reduced row echelon form of A is [,. 

(e) det(A) 40. 

(f) The matrix A does not have 0 as an eigenvector. 


When a theorem asserts that a list of statements is “equivalent,” it is 
asserting that either the statements are all true, or they are all false. 
Thus the above theorem tells us that whenever we are dealing with a 
particular nxn matrix A, then either the statements (a) through (f) are all 
true for A, or statements (a) through (f) are all false for A. For example, if 
we happen to know that det(A) 4 0, the theorem assures us that in addition 
to statement (e) being true, all the statements (a) through (f) are true. On 
the other hand, if it happens that det(A) = 0, the theorem tells us that all 
statements (a) through (f) are false. In this way, the theorem multiplies 
our knowledge of A by a factor of six. Obviously that can be very useful. 

What method would we use to prove such a theorem? In a certain 
sense, the above theorem is like an if-and-only-if theorem. An if-and-only-if 
theorem of form P © Q asserts that P and Q are either both true or both 
false, that is, that P and Q are equivalent. To prove P = Q we prove P>Q 
followed by Q => P, essentially making a “cycle” of implications from P to Q 
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and back to P. Similarly, one approach to proving the theorem about the 
nxn matrix would be to prove the conditional statement (a) > (b), then 
(b) > (c), then (c) > (d), then (d) > (e), then (e) > (f) and finally (f) > (a). 
This pattern is illustrated below. 


(a) = () => (ec) 


i y 
(ff) = () — @d) 


Notice that if these six implications have been proved, then it really does 
follow that the statements (a) through (f) are either all true or all false. If 
one of them is true, then the circular chain of implications forces them 
all to be true. On the other hand, if one of them (say (c)) is false, the fact 
that (b) > (c) is true forces (b) to be false. This combined with the truth of 
(a) > (b) makes (a) false, and so on counterclockwise around the circle. 

Thus to prove that n statements are equivalent, it suffices to prove n 
conditional statements showing each statement implies another, in circular 
pattern. But it is not necessary that the pattern be circular. The following 
schemes would also do the job: 


(a) => (b+) => (oe) 


ft 
(f) <= (e) —> (d) 


(a) —> (b+) — (ec) 


0 
(ff) —> (e) —> (d) 


But a circular pattern yields the fewest conditional statements that 
must be proved. Whatever the pattern, each conditional statement can be 
proved with either direct, contrapositive or contradiction proof. 

Though we shall not do any of these proofs in this text, you are sure to 
encounter them in subsequent courses. 


7.3 Existence Proofs; Existence and Uniqueness Proofs 


Up until this point, we have dealt with proving conditional statements 
or with statements that can be expressed with two or more conditional 
statements. Generally, these conditional statements have form P(x) > Q(x). 
(Possibly with more than one variable.) We saw in Section 2.8 that this 
can be interpreted as a universally quantified statement V x, P(x) > Q(x). 
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Thus, conditional statements are universally quantified statements, so 
in proving a conditional statement—whether we use direct, contrapositive 
or contradiction proof—we are really proving a universally quantified 
statement. 

But how would we prove an existentially quantified statement? What 
technique would we employ to prove a theorem of the following form? 


dx,R(x) 


This statement asserts that there exists some specific object x for which 
R(x) is true. To prove 4x,R(x) is true, all we would have to do is find and 
display an example of a specific x that makes R(x) true. 

Though most theorems and propositions are conditional (or if-and- 
only-if) statements, a few have the form 4x,R(x). Such statements are 
called existence statements, and theorems that have this form are called 
existence theorems. To prove an existence theorem, all you have to do 
is provide a particular example that shows it is true. This is often quite 
simple. (But not always!) Here are some examples: 


Proposition There exists an even prime number. 


Proof. Observe that 2 is an even prime number. a 


Admittedly, this last proposition was a bit of an oversimplification. The 
next one is slightly more challenging. 


Proposition There exists an integer that can be expressed as the sum 
of two perfect cubes in two different ways. 


Proof. Consider the number 1729. Note that 1° + 12? = 1729 and 9? + 10° = 
1729. Thus the number 1729 can be expressed as the sum of two perfect 
cubes in two different ways. a 


Sometimes in the proof of an existence statement, a little verification is 
needed to show that the example really does work. For example, the above 
proof would be incomplete if we just asserted that 1729 can be written as 
a sum of two cubes in two ways without showing how this is possible. 


WARNING: Although an example suffices to prove an existence statement, 
a single example does not prove a conditional statement. 
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Often an existence statement will be embedded inside of a conditional 
statement. Consider the following. (Recall the definition of gcd on page 88.) 


If a,b EN, then there exist integers k and ¢ for which gcd(a,b) =ak+bé. 


This is a conditional statement that has the form 
a,beN => Fk,leZ, ged(a,b)=ak+bé. 


To prove it with direct proof, we would first assume that a,b<«N, then 
prove the existence statement 4k,¢€ Z, gcd(a,b)=ak+b¢. That is, we 
would produce two integers k and ¢ (which depend on a and b) for which 
gcd(a,b)=ak+bé. Let’s carry out this plan. 


Proposition If a,b¢N, then there exist integers k and ¢ for which 
gcd(a,b)=ak+ dé. 


Proof. (Direct) Suppose a,b € N. Consider the set A = {ax+by:x,yeZ}. 
This set contains both positive and negative integers, as well as 0. (Reason: 
Let y=0 and let x range over all integers. Then ax+ by =ax ranges over 
all multiples of a, both positive, negative and zero.) Let d be the smallest 
positive element of A. Then, because d is in A, it must have the form 
d=ak+0bé for some specific k, @ € Z. 

To finish the proof we will show d = gcd(a,b). We will first argue that 
d is a common divisor of a and b, and then that it is the greatest common 
divisor. 

To see that d|a, use the division algorithm (page 28) to write a=qd+r 
for integers gq and r with 0<r<d. The equation a=qd+r yields 


r a-qd 
= a-q(ak+bd@) 


a(1-qk)+b(-q@). 


Therefore r has form r=ax+by, so it belongs to A. But 0<r<d andd 
is the smallest positive number in A, so r can’t be positive; hence r =0. 
Updating our equation a =qd+r, we get a=qd, so d|a. Repeating this 
argument with b= qd+r shows d|b. Thus d is indeed a common divisor 
of a and b. It remains to show that it is the greatest common divisor. 

As gcd(a,b) divides a and b, we have a=gcd(a,b)-m and b = gcd(a,b)-n 
for m,neéZ. Then d =ak+bé = gced(a,b)-mk+gcd(a,b)-né = ged(a,b)(mk+né), 
so d is a multiple of gcd(a,b). Hence d = gcd(a,b). But d can’t be a larger 
common divisor of a and 6 than gced(a,b), so d = gcd(a,b). a 
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We conclude this section with a discussion of so-called wniqueness 
proofs. Some existence statements have form “There is a unique x for 
which P(x).” Such a statement asserts that there is exactly one example x 
for which P(x) is true. To prove it, you must produce an example x=d for 
which P(d) is true, and you must show that d is the only such example. 
The next proposition illustrates this. In essence, it asserts that the set 
{ax+by:x,y€Z} consists precisely of all the multiples of gcd(a, bd). 


Proposition Suppose a,b¢N. Then there exists a unique d €N for which: 
An integer m is a multiple of d if and only if m=ax+by for some x, ye Z. 


Proof. Suppose a,beN. Let d= gcd(a,b). We now show that an integer m 
is a multiple of d if and only if m=ax+by for some x,ye Z. Let m=dn be 
a multiple of d. By the previous proposition, there are integers k and ¢ for 
which d=ak+b@. Then m=dn=(ak+b)n =alkn)+b(€n), so m=ax+by 
for integers x=kn and y=0n. 

Conversely, suppose m=ax+by for some x,ye€Z. Since d = gcd(a, bd) is 
a divisor of both a and b, we have a=dc and b=de for some c,ee€ Z. Then 
m=ax+by=dcx+dey=d(cx+ey), and this is a multiple of d. 

We have now shown that there is a natural number d with the property 
that m is a multiple of d if and only if m =ax+by for some x,ye Z. It 
remains to show that d is the wnique such natural number. To do this, 
suppose d’ is any natural number with the property that d has: 


m is a multiple of d’ <> m=ax+by for some x,ye Z. (7.1) 


We next argue that d' =d; that is, d is the unique natural number with 
the stated property. Because of (7.1), m=a-1+6-0=a is a multiple of d’. 
Likewise m=a-0+b-1=6 is a multiple of d'. Hence a and b are both 
multiples of d’, so d' is a common divisor of a and 6, and therefore 


d' <gcd(a,b) =d. 


But also, by (7.1), the multiple m =d'-1=d' of d’ can be expressed as 
d'=ax+by for some x,ye Z. As noted in the second paragraph of the proof, 
a=dc and b=de for some c,ee Z. Thus d'=ax+by=dcex+dey=d(cx+ey), 
so d’ is a multiple d. As d’ and d are both positive, it follows that 


d<d'. 


We’ve now shown that d’' <d and d <d’',sod=d'. The proofis complete. 
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7.4 Constructive Versus Non-Constructive Proofs 


Existence proofs fall into two categories: constructive and non-constructive. 
Constructive proofs display an explicit example that proves the theorem; 
non-constructive proofs prove an example exists without actually giving it. 
We illustrate the difference with two proofs of the same fact: There exist 
irrational numbers x and y (possibly equal) for which x” is rational. 


Proposition There exist irrational numbers x,y for which x” is rational. 


Proof. Let x = vo" and y= V2. We know y is irrational, but it is not clear 
whether x is rational or irrational. On one hand, if x is irrational, then 
we have an irrational number to an irrational power that is rational: 


V2 
we [v2""] eee 


On the other hand, if x is rational, then y” = Var” =x is rational. Either 
way, we have a irrational number to an irrational power that is rational. 


The above is a classic example of a non-constructive proof. It shows 
that there exist irrational numbers x and y for which x” is rational without 
actually producing (or constructing) an example. It convinces us that one 


of (20 or V2"" is an irrational number to an irrational power that 
is rational, but it does not say which one is the correct example. It thus 
proves that an example exists without explicitly stating one. 

Next comes a constructive proof of this statement, one that produces 
(or constructs) two explicit irrational numbers x,y for which x” is rational. 


Proposition There exist irrational numbers x,y for which x” is rational. 


Proof. Let x= V2 and y=log,9. Then 


es lov. 32 2los>3 92) logs 3 
yr aa/g ve - /2 O$9 = (v2) 2 — glog.3 _ 9 


As 3 is rational, we have shown that x” = 3 is rational. 

We know that «x = V2 is irrational. The proof will be complete if we 
can show that y =log,9 is irrational. Suppose for the sake of contradiction 
that log9 is rational, so there are integers a and 6 for which ¢ = logs9. 
This means 2” = 9, so (2”°)° = 9°, which reduces to 27 = 9°. But 2% is even, 
while 9° is odd (because it is the product of the odd number 9 with itself 
b times). This is a contradiction; the proof is complete. a 
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This existence proof has inside of it a separate proof (by contradiction) 
that log.9 is irrational. Such combinations of proof techniques are, of 
course, typical. 

Be alert to constructive and non-constructive proofs as you read proofs 
in other books and articles, as well as to the possibility of crafting such 
proofs of your own. 


Exercises for Chapter 7 


Prove the following statements. These exercises are cumulative, covering all 
techniques addressed in Chapters 4—7. 

. Suppose xe Z. Then x is even if and only if 3x +5 is odd. 

. Suppose xe Z. Then «x is odd if and only if 3x+6 is odd. 


24a is even if and only if a is even. 


Given an integer a, then a? +a 
Given an integer a, then a2+4a+5 is odd if and only if a is even. 

. An integer a is odd if and only if a° is odd. 

. Suppose x,yER. Then x? +x?y= y?2+xy if and only if y=x? or y=—x. 


. Suppose x,yeR. Then (x+y)? =x?+ y? if and only if x=0 or y=0. 


SNAP APR wD 


. Suppose a,be Z. Prove that a=b6 (mod 10) if and only if a=b (mod 2) anda=b 
(mod 5). 


9. Suppose ae Z. Prove that 14|a if and only if 7|a and 2|a. 
10. If ae Z, then a® =a (mod 38). 
11. Suppose a,b€ Z. Prove that (a—3)b? is even if and only if a is odd or b is even. 
12. There exist a positive real number x for which x? < \/x. 
13. Suppose a,be Z. If a+b is odd, then a? +b? is odd. 
14, Suppose ae Z. Then a?\a if and only if a€ {-1,0,1}. 
15. Suppose a,b« Z. Prove that a+6 is even if and only if a and b have the same 
parity. 
16. Suppose a,b€ Z. If ab is odd, then a? +b? is even. 
17. There is a prime number between 90 and 100. 
18. There is a set X for which Ne X and NcX. 
19. If nEN, then 2°+2!+27+234944...49% = Q"t+1_], 
20. There exists an n€N for which 11|(2”-1). 
21. Every real solution of x? +.x+3=0 is irrational. 
22. If ne Z, then 4|n? or 4|(n?-1). 
23. Suppose a,b and c are integers. If a|b and a|(b?—c), then a|c. 
24. If ae Z, then 4{(a?-3). 
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25. 


26. 
27. 
28. 


29. 


30. 


31. 
32. 
33. 
34. 


35. 
36. 


If p >1 is an integer and n{p for each integer n for which 2<n< ,/p, then p is 
prime. 


The product of any n consecutive positive integers is divisible by n!. 
Suppose a,b¢ Z. If a?+6? is a perfect square, then a and b are not both odd. 


Prove the division algorithm: If a,b e«N, there exist wnique integers q,r for 
which a=bq+r, and 0<r<b. (A proof of existence is given in Section 1.9, but 
uniqueness needs to be established too.) 


If a| bc and ged(a,b)=1, then a|c. 
(Suggestion: Use the proposition on page 124.) 


Suppose a,b,p €« Z and p is prime. Prove that if p|ab then p|a or p|b. 
(Suggestion: Use the proposition on page 124.) 


If ne Z, then ged(n,n+1)=1. 
If ne Z, then ged(n,n +2)e {1,2}. 
If ne Z, then ged(2n + 1,4n?+1)=1. 


If ged(a,c) = ged(b,c) = 1, then ged(ab,c) = 1. 
(Suggestion: Use the proposition on page 124.) 


Suppose a,beN. Then a = ged(a,d) if and only if a| b. 
Suppose a,beN. Then a =Ilcm(a,)d) if and only if b| a. 


CHAPTER 8 


Proofs Involving Sets 


tudents in their first advanced mathematics classes are often surprised 
by the extensive role that sets play and by the fact that most of the 
proofs they encounter are proofs about sets. Perhaps you’ve already seen 
such proofs in your linear algebra course, where a vector space was 
defined to be a set of objects (called vectors) that obey certain properties. 
Your text proved many things about vector spaces, such as the fact that 
the intersection of two vector spaces is also a vector space, and the proofs 
used ideas from set theory. As you go deeper into mathematics, you will 
encounter more and more ideas, theorems and proofs that involve sets. 
The purpose of this chapter is to give you a foundation that will prepare 
you for this new outlook. 

We will discuss how to show that an object is an element of a set, how 
to prove one set is a subset of another and how to prove two sets are 
equal. As you read this chapter, you may need to occasionally refer back 
to Chapter 1 to refresh your memory. For your convenience, the main 
definitions from Chapter 1 are summarized below. If A and B are sets, 
then: 


AxB = {(x,y):x€A, ye B}, 

AUB = {x:(xeA)v(xeB), 

ANB = {x:(xeE A)A(ceB), 

A-B = {x:(xeA)A(x¢ Bf, 
A = U-A. 


Recall that A <B means that every element of A is also an element of B. 


8.1 How to Prove acA 


We will begin with a review of set-builder notation, and then review how 
to show that a given object a is an element of some set A. 
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Generally, a set A will be expressed in set-builder notation A = {x : P(x)}, 
where P(x) is some statement (or open sentence) about x. The set A is 
understood to have as elements all those things x for which P(x) is true. 
For example, 


{x:x is an odd integer} = {...,-5,-3,-1,1,3,5,...}. 


A common variation of this notation is to express a set as A= {xe S:P(x)}. 
Here it is understood that A consists of all elements x of the (predetermined) 
set S for which P(x) is true. Keep in mind that, depending on context, x 
could be any kind of object (integer, ordered pair, set, function, etc.). There 
is also nothing special about the particular variable x; any reasonable 
symbol x, y, k, etc., would do. Some examples follow. 


jneZ:nisodd} = {...,-5,-3,-1,1,3,5,...} 
{ceN:6|x} = {6,12,18,24,30,...} 
{(a,b)€ZxZ:b=a+5} = {...,(-2,3),(-1,4),(0,5),(1,6),...} 
{X € A(Z):|X| = 1} {...,4—1}, {0}, {1}, {2}, {3}, {4}... } 


Now it should be clear how to prove that an object a belongs to a set 
{x :P(x)}. Since {x :P(x)} consists of all things x for which P(x) is true, to 
show that aé {x : P(x)} we just need to show that P(a) is true. Likewise, to 
show ae {xe S :P(x)}, we need to confirm that ae S and that P(a) is true. 
These ideas are summarized below. However, you should not memorize 
these methods, you should understand them. With contemplation and 
practice, using them becomes natural and intuitive. 


How to show ae {x: P(x)} How to show ae {x<S: P(x)} 


Show that P(a) is true. 1. Verify that ace S. 
2. Show that P(a) is true. 


Example 8.1 _Let’s investigate elements of A = {x:x¢N and 7|x}. This set 
has form A = {x:P(«x)} where P(x) is the open sentence (x € N)A(7|«). Thus 
21¢A because P(21) is true. Similarly, 7,14,28,35, etc., are all elements of 
A. But 8¢A (for example) because P(8) is false. Likewise —14¢ A because 
P(-14) is false. 


Example 8.2 Consider the set A = {X € A(N): |X| = 3}. We know that 
{4,13,45}€ A because {4, 13,45} « A(N) and |{4,13,45}|=3. Also {1,2,3}¢€A, 
{10,854,3} € A, etc. However {1,2,3,4}¢A because |{1,2,3,4}| #3. Further, 
{-1,2,3$¢A because {-1,2,3}¢ P(N). 
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Example 8.3 Consider the set B = {(x,y)¢ Zx Z:x = y(mod 5)}. Notice 
(8,23) € B because (8,23) € Zx Z and 8= 23 (mod 5). Likewise, (100,75) « B, 
(102,77) €B, etc., but (6,10) ¢B. 

Now suppose né Z and consider the ordered pair (4n + 3,9n —2). Does 
this ordered pair belong to B? To answer this, we first observe that 
(4n+3,9n-2)€ ZxZ. Next, we observe that (4n+3)—(9n-2) = —5n+5 = 5(1-n), 
so 5|((4n+3)—(9n-2)), which means (4n+3) =(9n—2) (mod 5). Therefore we 
have established that (4n + 3,9n —2) meets the requirements for belonging 
to B, so (4n+3,9n-—2)e€B for every ne Z. 


Example 8.4 This illustrates another common way of defining a set. 
Consider the set C = {8x?+2:xeZ}. Elements of this set consist of all the 
values 3x°+2 where x is an integer. Thus —22€ C because —22 = 3(-2)? +2. 
You can confirm —1¢C and 5€C, etc. Also 0¢C and 5¢ C, etc. 


8.2 How to Prove AcB 


In this course (and more importantly, beyond it) you will encounter many 
circumstances where it is necessary to prove that one set is a subset of an- 
other. This section explains how to do this. The methods we discuss should 
improve your skills in both writing your own proofs and in comprehending 
the proofs that you read. 

Recall (Definition 1.3) that if A and B are sets, then ACB means that 
every element of A is also an element of B. In other words, it means if 
ac A, then ae B. Therefore to prove that A <B, we just need to prove that 
the conditional statement 


“If ae A, then ae B” 


is true. This can be proved directly, by assuming a¢ A and deducing aé B. 
The contrapositive approach is another option: Assume a¢ B and deduce 
a¢A. Each of these two approaches is outlined below. 


How to Prove ACB How to Prove ACB 
(Direct approach) (Contrapositive approach) 
Proof. Suppose ac A. Proof. Suppose a¢B. 
Therefore ac B. Therefore a ¢ A. 
Thus ac€ A implies ac B, Thus a¢B implies a¢ A, 
so it follows that ACB. UH so it follows that ACB. UH 
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In practice, the direct approach usually results in the most straight- 
forward and easy proof, though occasionally the contrapositive is the 
most expedient. (You can even prove ACB by contradiction: Assume 
(ae A)A(a¢B), and deduce a contradiction.) The remainder of this section 
consists of examples with occasional commentary. Unless stated otherwise, 
we will use the direct approach in all proofs; pay special attention to how 
the above outline for the direct approach is used. 


Example 8.5 Prove that {xe Z:18|x}c{xeZ:6|x}. 


Proof. Suppose ae {x€ Z:18|x}. 
This means that ae Z and 18|a. 
By definition of divisibility, there is an integer c for which a = 18c. 
Consequently a = 6(3c), and from this we deduce that 6|a. 
Therefore a is one of the integers that 6 divides, so ae {xe Z:6|x}. 

We’ve shown aé {x€ Z:18|x} implies ae {ne Z:6|x}, so it follows that 
{x€Z:18|x}c{xeZ:6|x}. a 


Example 8.6 Prove that {xe Z:2|x}n{xeZ:9|xbc{xeZ:6|x}. 


Proof. Suppose ac€ {xe Z:2|x}n{xeZ:9|x}. 
By definition of intersection, this means ae {xe Z:2|x} and ae {xeEZ:9|x}. 
Since aeé {x € Z:2|x} we know 2|a, so a=2c for some ce Z. Thus a is even. 
Since a€ {xe Z:9|x} we know 9|a, so a=9d for some de Z. 
As a is even, a = 9d implies d is even. (Otherwise a = 9d would be odd.) 
Then d = 2e for some integer e, and we have a = 9d = 9(2e) = 6(8e). 
From a = 6(3e), we conclude 6|a, and this means ae {x€ Z:6|x}. 

We have shown that ae {xe Z:2|x}n{xeZ:9|x} implies ae {xe Z:6|x}, 
so it follows that {xe Z:2|x}n{xeEZ:9|x}c{xeZ:6|x}. a 


Example 8.7 Show {(x,y)¢ ZxZ:x = y (mod 6)} € {(x,y)€ZxZ:x = y(mod 3)}. 


Proof. Suppose (a,b) € {(x,y)€Z x Z:x = y(mod 6)}. 
This means (a,b)« Zx Z and a=b (mod 6). 
Consequently 6|(a—b), so a—b =6c for some integer c. 
It follows that a—b = 3(2c), and this means 3|(a—b), so a=b (mod 3). 
Thus (a,b) {(x,y)€Zx Z:x= y(mod 3)}. 

We’ve now seen that (a,b) € {(x,y)€ Zx Z:x = y(mod 6)} implies (a,b) € 
{(x,y)€Zx Z:x=y(mod 3)}, so it follows that {(x,y)¢Zx Z:x = y(mod 6)}¢ 
{(x,y)€ZxZ:x=y(mod 3)}. a 
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Some statements involving subsets are transparent enough that we 
often accept (and use) them without proof. For example, if A and B are any 
sets, then it’s very easy to confirm ANBCA. (Reason: Suppose xe ANB. 
Then x¢€ A and x¢€B by definition of intersection, so in particular x¢€ A. 
Thus x€ ANB implies xe A, so ANBCA.) Other statements of this nature 
include AC AUB and A—B¢CA, as well as conditional statements such as 
((ACB)A(BCC)) > (ASC) and (X <A) >(X CAUB). Our point of view in 
this text is that we do not need to prove such obvious statements unless we 
are explicitly asked to do so in an exercise. (Still, you should do some quick 
mental proofs to convince yourself that the above statements are true. If 
you don’t see that ANBCA is true but that AC ANB is not necessarily 
true, then you need to spend more time on this topic.) 

The next example will show that if A and B are sets, then Y(A)UA(B) 
Y(AUB). Before beginning our proof, let’s look at an example to see if 
this statement really makes sense. Suppose A = {1,2} and B = {2,3}. Then 


PALAB) = {9,{1h, {2h {1, 2k u{a, {at {3}, {2, ah} 
{D,{1}, {2}, {3}, {1,2}, {2,3}. 


Also A(AUB) = A(4{1,2,3}) = {, {1}, {2k {3}, {1,2}, {2,3}, {1,3}, {12,3}. Thus, 
even though A(A)U A(B) 4 A(AUB), it is true that A(A)U A(B) ¢ A(AUB) 
for this particular A and B. Now let’s prove W(A)U A(B)c A(AUB) no 
matter what sets A and B are. 


Example 8.8 Prove that if A and B are sets, then AW(A)UA(B) S A(AUB). 


Proof. Suppose X € A(A)U A(B). 
By definition of union, this means X € A(A) or X € A(B). 
Therefore X <A or X CB (by definition of power sets). We consider cases. 
Case 1. Suppose X cA. Then X CAUB, and this means X € A(A UB). 
Case 2. Suppose X <B. Then X CAUB, and this means X € P(A UB). 
(We do not need to consider the case where X <A and X CB because that 
is taken care of by either of cases 1 or 2.) The above cases show that 
X€ A(AvuB). 

Thus we’ve shown that X €« A(A)U A(B) implies X ¢ A(A UB), and this 
completes the proof that A(A)U A(B)c A(A UB). a 


In our next example, we prove a conditional statement. Direct proof is 
used, and in the process we use our new technique for showing ACB. 
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Example 8.9 Suppose A and B are sets. If A(A)c A(B), then ACB. 


Proof. We use direct proof. Assume A(A)¢ A(B). 

Based on this assumption, we must now show that ACB. 

To show ACB, suppose that ac A. 

Then the one-element set {a} is a subset of A, so {a} ¢ Y(A). 

But then, since A(A)¢c A(B), it follows that {a} « AB). 

This means that {a} ¢B, hence ac B. 

We’ve shown that ac A implies a€ B, so therefore ACB. a 


8.3 How to Prove A=B 


In proofs it is often necessary to show that two sets are equal. There is a 
standard way of doing this. Suppose we want to show A=B. If we show 
ACB, then every element of A is also in B, but there is still a possibility 
that B could have some elements that are not in A, so we can’t conclude 
A=B. But if in addition we also show BCA, then B can’t contain anything 
that is not in A, so A=B. This is the standard procedure for proving A = B: 
Prove both ACB and BCA. 


How to Prove A=B 


Proof. 
[Prove that ACB.] 
[Prove that BC A.] 


Therefore, since ACB and BCA, 
it follows that A=B. a 


Example 8.10 Prove that {n¢Z:35|n}={neZ:5|n}n{neZ:7\n}. 


Proof. First we show {n€ Z:35|n}c{neZ:5|n}n{neZ:7|n}. Suppose 
a€é {n€Z:35|n}. This means 35|a, so a = 35c for some ce Z. Thus a=5(7c) 
and a= 7(5c). From a =5(7c) it follows that 5|a, soae {ne Z:5|n}. From 
a =7(5c) it follows that 7|a, which means ae {ne Z:7|n}. As a belongs 
to both {ne Z:5|n} and {ne Z:7|n}, we get ae {neZ:5|n}n{neZ:7\|n}. 
Thus we’ve shown that {ne Z:35|n}c{neZ:5|nb}n{neZ:7\n}. 

Next we show {néZ:5|n}n{neZ:7|n}c {ne Z:35|n}. Suppose that 
aé {ne Z:5|n}n{neZ:7|n}. By definition of intersection, this means that 
ae {neZ:5\|n} and ae {neZ:7\n}. Therefore it follows that 5|a and 7\/a. 
By definition of divisibility, there are integers c and d witha = 5c and a=7d. 
Then a has both 5 and 7 as prime factors, so the prime factorization of a 
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must include factors of 5 and 7. Hence 5-7 = 35 divides a, soae {n€ Z:35|n}. 
We’ve now shown that {ne Z:5|n}n{neZ:7\|n} c{neZ:35|n}. 

At this point we’ve shown that {née Z:35|n}c{neZ:5|n}n{neZ:7\n} 
and {ne Z:5|n}n{neZ:7|n}<c {ne Z:35|n}, so we've proved {n € Z:35|n} = 
{neZ:5|n}n{neZ:7\n}. a 


You know from algebra that if c#0 and ac=bc, then a=b. The next 
example shows that an analogous statement holds for sets A,B and C. The 
example asks us to prove a conditional statement. We will prove it with 
direct proof. In carrying out the process of direct proof, we will have to 
use the new techniques from this section. 


Example 8.11 Suppose A, B, and C are sets, and C#¢@. Prove that if 
AxC=BxC, then A=B. 


Proof. Suppose A x C=BxC. We must now show A =B. 

First we will show ACB. Suppose ac A. Since C # g, there exists 
an element ce C. Thus, since ae A and ce C, we have (a,c)€ Ax C, by 
definition of the Cartesian product. But then, since A x C=BxC, it follows 
that (a,c)e Bx C. Again by definition of the Cartesian product, it follows 
that ae B. We have shown ac€ A implies ae B, so ACB. 

Next we show BCA. We use the same argument as above, with the 
roles of A and B reversed. Suppose ace B. Since C # @, there exists an 
element ce C. Thus, since ae B and ce C, we have (a,c)e Bx C. But then, 
since Bx C=AxC, we have (a,c)€ AxC. It follows that ace A. We have 
shown a¢€B implies ae A, so BCA. 

The previous two paragraphs have shown ACB and BCA, so A=B. In 
summary, we have shown that if Ax C=BxC, then A=B. This completes 
the proof. a 


Now we'll look at another way that set operations are similar to oper- 
ations on numbers. From algebra you are familiar with the distributive 
property a-(b+c)=a-b+a-c. Replace the numbers a,b,c with sets A,B,C, 
and replace - with x and + with n. We get Ax(BnNC)=(AxB)n(AxC),. 
This statement turns out to be true, as we now prove. 


Example 8.12 Given sets A, B, and C, prove Ax(BNC)=(AxB)n(AxC),. 


Proof. First we will show that A x(BnC)¢(A x B)n(A x C). 

Suppose (a,b)€ Ax (BNC). 

By definition of the Cartesian product, this means ae A and be BNC. 
By definition of intersection, it follows that be B and beEC. 


136 Proofs Involving Sets 


Thus, since ac A and b€B, it follows that (a,b)¢ A x B (by definition of x). 
Also, since ace A and b€C, it follows that (a,b)¢ A x C (by definition of x). 
Now we have (a,b)€ Ax B and (a,b)e Ax C, so (a,b)E(A x B)N(AxC). 
We’ve shown that (a,b)¢ A x (BNC) implies (a,b) € (A x B)N(A x C) so we 
have A x(BnNC)c(AxB)n(AxC). 

Next we will show that (A x B)n(AxC)GAx(BNC). 
Suppose (a,b) «(A x B)n(A x C). 
By definition of intersection, this means (a,b)¢ A xB and (a,b)EAxC. 
By definition of the Cartesian product, (a,b)<¢ Ax B means ac A and beB. 
By definition of the Cartesian product, (a,b)¢ Ax C means ace A and DEC. 
We now have 0€B and b€C, so bE BNC, by definition of intersection. 
Thus we’ve deduced that ae A and be BNC, so (a,b)E Ax (BNC). 
In summary, we’ve shown that (a,b) € (Ax B)n(AxC) implies (a,b) € Ax(BNC) 
so we have (Ax B)N(AxC)GA x(BNC). 

The previous two paragraphs show that A x(BnC)<¢(AxB)n(AxC) and 
(A xB)n(AxC)cAx(BNC), so it follows that (Ax B)n(AxC)=Ax(BNC). ff 


Occasionally you can prove two sets are equal by working out a series of 
equalities leading from one set to the other. This is analogous to showing 
two algebraic expressions are equal by manipulating one until you obtain 
the other. We illustrate this in the following example, which gives an 
alternate solution to the previous example. You are cautioned that this 
approach is sometimes difficult to apply, but when it works it can shorten 
a proof dramatically. 

Before beginning the example, a note is in order. Notice that any 
statement P is logically equivalent to P \P. (Write out a truth table if you 
are in doubt.) At one point in the following example we will replace the 
expression x¢€ A with the logically equivalent statement (x € A) A (x€ A). 


Example 8.13 Given sets A, B, and C, prove A x(BNC)=(AxB)n(AxC),. 


Proof. Just observe the following sequence of equalities. 


Ax(BNC) = {(x,y):(@we A)A(yEBNC)} (def. of x) 
= {(x,y): (xe AVA(yEB)A(yEC)} (def. of n) 
= {(x,y) (xe ADA(KEA)A(VEB)A(yEC)} (P=P AP) 
= {(x,y): (xe AVA(y¥EB)) A(x E AVA(yEC))} (rearrange) 
= {(x,y): (xe A)A(yVEB)}n {(x,y) (aE AVA(yEC)} (def. of n) 
=(AxB)nN(AxC) (def. of x) 


The proof is complete. a 
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The equation A x(BNC) =(A x B)n(A xC) just obtained is a fundamental 
law that you may actually use fairly often as you continue with mathematics. 
Some similar equations are listed below. Each of these can be proved with 
this section’s techniques, and the exercises will ask that you do so. 


ANB=AUB 
AUB=AnB 
AN(BUC)=(ANB)U(ANC) 
AU(BNC)=(AUB)N(AUC) 
A x(BUC)=(A x B)U(A x C) 
Ax(BnC)=(A x B)n(A x C) 


DeMorgan’s laws for sets 
Distributive laws for sets 


Distributive laws for sets 


It is very good practice to prove these equations. Depending on your 
learning style, it is probably not necessary to commit them to memory. 
But don’t forget them entirely. They may well be useful later in your 
mathematical education. If so, you can look them up or re-derive them on 
the spot. If you go on to study mathematics deeply, you will at some point 
realize that you’ve internalized them without even being cognizant of it. 


8.4 Examples: Perfect Numbers 


Sometimes it takes a good bit of work and creativity to show that one set 
is a subset of another or that they are equal. We illustrate this now with 
examples from number theory involving what are called perfect numbers. 
Even though this topic is quite old, dating back more than 2000 years, it 
leads to some questions that are unanswered even today. 

The problem involves adding up the positive divisors of a natural 
number. To begin the discussion, consider the number 12. If we add up the 
positive divisors of 12 that are less than 12, we obtain 1+2+3+4+6=16, 
which is greater than 12. Doing the same thing for 15, we get 1+3+5=9 
which is less than 15. For the most part, given a natural number p, the 
sum of its positive divisors less than itself will either be greater than p 
or less than p. But occasionally the divisors add up to exactly p. If this 
happens, then p is said to be a perfect number. 


Definition 8.1 A number pé€N is perfect if it equals the sum of its 
positive divisors less than itself. Some examples follow. 

¢ The number 6 is perfect since 6=1+2+3. 

e The number 28 is perfect since 28 =1+2+4+7+14. 

¢ The number 496 is perfect since 496 = 1+2+4+8+16+31+624+124+ 248. 
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Though it would take a while to find it by trial-and-error, the next 
perfect number after 496 is 8128. You can check that 8128 is perfect. Its 
divisors are 1, 2, 4, 8, 16, 32, 64, 127, 254, 508, 1016, 2032, 4064 and indeed 


8128 =1+2+4+8+16+32+64+127+ 254+ 508 + 1016 + 2032 + 4064. 


Are there other perfect numbers? How can they be found? Do they obey any 
patterns? These questions fascinated the ancient Greek mathematicians. 
In what follows we will develop an idea—recorded by Euclid—that partially 
answers these questions. Although Euclid did not use sets,! we will 
nonetheless phrase his idea using the language of sets. 

Since our goal is to understand what numbers are perfect, let’s define 
the following set: 

P={peN: p is perfect}. 


Therefore P = {6,28,496,8128,...}, but it is unclear what numbers are in 
P other than the ones listed. Our goal is to gain a better understanding 
of just which numbers the set P includes. To do this, we will examine 
the following set A. It looks more complicated than P, but it will be very 
helpful for understanding P, as you will soon see. 


A = {2”-1(2” 1): neN, and 2”—1 is prime} 


In words, A consists of every natural number of form 2”~1(2” — 1), where 
2”-—1is prime. To get a feel for what numbers belong to A, look at the 
following table. For each natural number n, it tallies the corresponding 
numbers 2”~! and 2” —1. If 2”-—1 happens to be prime, then the product 
2”-1(2” — 1) is given; otherwise that entry is labeled with an «. 


nm | o8 4. || Fn | ot to" 4) 
1 1 1 x 

2 2 3 6 

3 4 7 28 

4 8 15 * 

5 16 31 496 

6 32 63 * 

7 64 127 8128 

8 | 128 | 255 * 

9 | 256 | 511 * 

10 | 512 | 1023 * 

11 | 1024 | 2047 * 

12 | 2048 | 4095 * 

13 | 4096 | 8191 | 33,550,336 


1Set theory was invented over 2000 years after Euclid died. 
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Notice that the first four entries of A are the perfect numbers 6, 28, 
496 and 8128. At this point you may want to jump to the conclusion that 
A=P. But it is a shocking fact that in over 2000 years no one has ever 
been able to determine whether or not A=P. But it is known that ACP, 
and we will now prove it. In other words, we are going to show that every 
element of A is perfect. (But by itself, that leaves open the possibility that 
there may be some perfect numbers in P that are not in A.) 

The main ingredient for the proof will be the formula for the sum of 
a geometric series with common ration r. You probably saw this most 
recently in Calculus II. The formula is 


n rrtl_y 
ort 
k=0 r-1 


We will need this for the case r = 2, which is 
n 
ae eee (8.1) 
h=0 


(See the solution for Exercise 19 in Section 7.4 for a proof of this for- 
mula.) Now we are ready to prove our result. Let’s draw attention to its 
significance by calling it a theorem rather than a proposition. 


Theorem 8.1 If A={2"-1(2”-1):neN, and 2”-1 is prime} and P = 
{peEN: p is perfect}, then ACP. 


Proof. Assume A and P are as stated. To show ACP, we must show that 
péA implies pe P. Thus suppose pe A. By definition of A, this means 


p=2" 12" -1) (8.2) 


for some né€N for which 2”-—1 is prime. We want to show that p«P, that 
is, we want to show p is perfect. Thus, we need to show that the sum of 
the positive divisors of p that are less than p add up to p. Notice that 
since 2”—1 is prime, any divisor of p = 2”-1(2”— 1) must have the form 2* 
or 2"(2"—1) for 0< k<n-1. Thus the positive divisors of p are as follows: 


2° 2" 2” ee aes ane 
90(0t 27), DIOP aN).. -92(2P 1): Pigg COE FOP TD). -B95107 = 1); 


Notice that this list starts with 2° =1 and ends with 2”"1(2”-1)=p. 
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If we add up all these divisors except for the last one (which equals p) 
we get the following: 


n 


-1 n-2 n-1 n-2 
2° + > 2*(2"—1) > 2? 4(2”-1) > 2" 
k=0 k=0 k=0 k=0 
= (2”-1)+(2"-1)(2”"!-1) (by Equation (8.1)) 
= [1+(2""!-1)1@"-1) 
= 2”-1(2"-1) 


Dp (by Equation (8.2)). 


This shows that the positive divisors of p that are less than p add up to p. 
Therefore p is perfect, by definition of a perfect number. Thus pe P, by 
definition of P. 

We have shown that pe A implies pe P, which means ACP. a 


Combined with the chart on the previous page, this theorem gives us 
a new perfect number! The element p = 2!?-1(2!° — 1) = 33,550,336 in A is 
perfect. 

Observe also that every element of A is a multiple of a power of 2, 
and therefore even. But this does not mean every perfect number is even, 
because we’ve only shown ACP, not A=P. Are there any odd perfect 
numbers? No one knows. 

In over 2000 years, no one has ever found an odd perfect number, nor 
has anyone been able to prove that every perfect number is even. But 
it is known that the set A does contain every even perfect number. This 
fact was first proved by Euler, and we duplicate his reasoning in the next 
theorem, which proves that A =F, where E is the set of all even perfect 
numbers. It is a good example of how to prove two sets are equal. 

For convenience, we are going to use a slightly different definition of a 
perfect number. A number p€N is perfect if its positive divisors add up 
to 2p. For example, the number 6 is perfect since the sum of its divisors 
is 1+2+3+6=2-6. This definition is simpler than the first one because 
we do not have to stipulate that we are adding up the divisors that are 
less than p. Instead we add in the last divisor p, and that has the effect 
of adding an additional p, thereby doubling the answer. 
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Theorem 8.2 If A= {2”-1(2”-1):neN, and 2”-1 is prime} and E = 
{peN: p is perfect and even}, then A=E. 


Proof. To show that A =E, we need to show ACE and ECA. 

First we will show that ACE. Suppose pe A. This means p is even, 
because the definition of A shows that every element of A is a multiple of 
a power of 2. Also, p is a perfect number because Theorem 8.1 states that 
every element of A is also an element of P, hence perfect. Thus p is an 
even perfect number, so p¢ E. Therefore ACE. 

Next we show that Ec A. Suppose pe FE. This means p is an even 
perfect number. Write the prime factorization of p as p = 2’3"15™7"..., 
where the powers nj, ng, n3... may be zero. But, as p is even, the integer 
k must be greater than zero. It follows p = 2"q for some positive integer k 
and an odd integer gq. Now, our aim is to show that pe A, which means we 
must show p has form p = 2”-1(2”-1). To get our current p =2"¢q closer to 
this form, let n =k+1, so we now have 


p=2" 1g. (8.3) 


List the positive divisors of g as d1,d2,d3,...,dm. (Where dj =1 and d,» =q.) 
Then the divisors of p are: 


29d, 2%do Dds ae As 
21dy 21do 21d oh 21d 
22d 22 do 27d3 ae 22dm 
23 d1 2? do 23 dz ei, dm 
oe lg, OP-lgy 28 *ds . vee an 


Since p is perfect, these divisors add up to 2p. By Equation (8.3), their 

sum is 2p = 2(2”-'q) =2”q. Adding the divisors column-by-column, we get 
n-1 n-1 n-1 n-1 
> dit ¥ 2*dot ¥ Qdgt--+ > dm = 2%¢. 
h=0 h=0 h=0 h=0 

Applying Equation (8.1), this becomes 


(2” — 1)dy + (2” — I)dg +(2" —1)dg +--+ +(2"—1)dm = 2"q 
(2” —1)(dy +dg+d3++:++dm)=2"q 
2” q 


dy+dg+dg+--+dm=>5,— 7: 
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so that 


(2"-1+1)q_ (2"-lq+q _ q 


+ + feeet = a a . 
oe a dm Qn] Qn] Qn] 


From this we see that 54, is an integer. It follows that both q and 54, 
are positive divisors of g. Since their sum equals the sum of all positive 
divisors of q, it follows that gq has only two positive divisors, g and wot: 
Since one of its divisors must be 1, it must be that oF = 1, which means 
q = 2”-1. Now a number with just two positive divisors is prime, so 
q=2”"-1is prime. Plugging this into Equation (8.3) gives p = 2”-1(2” — 1), 
where 2”—1 is prime. This means p€ A, by definition of A. We have now 
shown that pe E implies pe A, so ECA. 


Since ACE and ECA, it follows that A=E. a 


Do not be alarmed if you feel that you wouldn’t have thought of this 
proof. It took the genius of Euler to discover this approach. 
We'll conclude this chapter with some facts about perfect numbers. 


* The sixth perfect number is p = 2!’~1(2!" — 1) = 8589869056. 

¢ The seventh perfect number is p = 2!9-1(2!9 — 1) = 137438691328. 

* The eighth perfect number is p = 22! 1(291 — 1) = 2305843008139952128. 

¢ The twentieth perfect number is p = 2442°-1(24473 — 1). It has 2663 digits. 


¢ The twenty-third perfect number is p = 2'213-1(2718_ 1). It has 6957 
digits. 

As mentioned earlier, no one knows whether or not there are any odd 
perfect numbers. It is not even known whether there are finitely many or 
infinitely many perfect numbers. It is known that the last digit of every 
even perfect number is either a 6 or an 8. Perhaps this is something you’d 
enjoy proving. 

We’ve seen that perfect numbers are closely related to prime numbers 
that have the form 2”-1. Such prime numbers are called Mersenne 
primes, after the French scholar Marin Mersenne (1588-1648), who 
popularized them. The first several Mersenne primes are 27-1 = 3, 
23-1=7, 2°-1=31, 27-1=127 and 2!-1= 8191. To date, only 
47 Mersenne primes are known, the largest of which is 24%1!2-69 — 1, 
There is a substantial cash prize for anyone who finds a 48th. (See 
http://www.mersenne.org/prime.htm.) You will probably have better luck 
with the exercises. 
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Exercises for Chapter 8 
Use the methods introduced in this chapter to prove the following statements. 


. Prove that {12n:neZ}c{2n:neZ}n{3n:ne Z}. 

. Prove that {6n:neZ}= {2n:neZ}n{3n:ne Z}. 

. IfkeZ, then {ne Z:n|k}o{neZ:n|k*}. 

. If m,neZ, then {xe Z:mn|x}c{xeZ:m|xba{xeZ:n|x}. 

. If p and q are positive integers, then {pn:neN}n{qn:neN}Z@. 

. Suppose A,B and C are sets. Prove that if ACB, then A-CCB-C. 
. Suppose A,B and C are sets. If BCC, then Ax BC AxC. 

If A,B and C are sets, then AU(BNC)=(AUB)N(AUC). 

. If A,B and C are sets, then AN(BUC)=(AnNB)U(ANC). 


CONIA HP wD 


NNN ND &— = & SB eS eB eS Se 
Oonormeoccdbdmrmassranahwnwn = © 


iw) 
~ 


ow nonwnN WS NW 
me owenmnn DOH 


. Prove that (\[3—x?,5+x71=[3,51. 


xeR 


. If A and B are sets in a universal set U, then ANB=AUB. 
. If A and B are sets in a universal set U, then AUB=ANB. 
. If A,B and C are sets, then A—(BNC)=(A-B) 
. If A,B and C are sets, then A—(BUC)=(A-B) 
. If A,B and C are sets, then (AUB)—C =(A-C) 
. If A,B and C are sets, then (AnB)—-C=(A-C) 
. If A,B and C are sets, then A x (BUC) =(A x B) 
. If A,B and C are sets, then A x (BNC)=(A x B) 
. If A,B and C are sets, then A x (B—C)=(A x B) 
. Prove that {9”:neZ}<{3":neZ}, but {9”:neZ} 4 {3":neZz} 
. Prove that {9” :n € Q} = {3":ne Q}. 
. Suppose A and B are sets. Prove ACB if and only if A-B=@. 

. Let A and B be sets. Prove that A CB if and only if ANB=A. 

. For each a€R, let Ag = {(x,a(x?-1))€ R?: xe R}. Prove that (] Aa = {(-1,0),(1,0)}. 


U(A-C). 
n(A-C). 
U(B-C). 
n(B-C). 
U(A x C). 
n(A x C). 
-(AxC). 


. Suppose A,B,C and D are sets. Prove that (A x B)U(C x D) < (AUC) x (BUD). 
. Prove {4k+5:keZ}={4k+1:keZ}. 

. Prove {12a+4b:a,be Z} = {4c:ceZ}. 

. Prove {12a+25b:a,beZ}=Z. 

. Suppose AZ @. Prove that Ax BC Ax C, if and 
. Prove that (Z x N)n(Nx Z)=NxN. 

. Suppose B4¢ and Ax BCBxC. Prove AEC. 


only if BCC. 


CHAPTER 9 


Disproof 


Rhy” since Chapter 4 we have dealt with one major theme: Given a 
statement, prove that is it true. In every example and exercise we 
were handed a true statement and charged with the task of proving it. 
Have you ever wondered what would happen if you were given a false 
statement to prove? The answer is that no (correct) proof would be possible, 
for if it were, the statement would be true, not false. 

But how would you convince someone that a statement is false? The 
mere fact that you could not produce a proof does not automatically mean 
the statement is false, for you know (perhaps all too well) that proofs 
can be difficult to construct. It turns out that there is a very simple and 
utterly convincing procedure that proves a statement is false. The process 
of carrying out this procedure is called disproof. Thus, this chapter is 
concerned with disproving statements. 

Before describing the new method, we will set the stage with some 
relevant background information. First, we point out that mathematical 
statements can be divided into three categories, described below. 

One category consists of all those statements that have been proved to be 
true. For the most part we regard these statements as significant enough 
to be designated with special names such as “theorem,” “proposition,” 
“lemma” and “corollary.” Some examples of statements in this category are 
listed in the left-hand box in the diagram on the following page. There are 
also some wholly uninteresting statements (such as 2 = 2) in this category, 
and although we acknowledge their existence we certainly do not dignify 
them with terms such as “theorem” or “proposition.” 

At the other extreme is a category consisting of statements that are 
known to be false. Examples are listed in the box on the right. Since 
mathematicians are not very interested in them, these types of statements 
do not get any special names, other than the blanket term “false statement.” 

But there is a third (and quite interesting) category between these 
two extremes. It consists of statements whose truth or falsity has not 
been determined. Examples include things like “Every perfect number 
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is even,” or “Every even integer greater than 2 is the sum of two primes. 
(The latter statement is called the Goldbach conjecture. See Section 2.1.) 
Mathematicians have a special name for the statements in this category 
that they suspect (but haven’t yet proved) are true. Such statements are 


called conjectures. 


THREE TYPES OF STATEMENTS: 


Known to be true 
(Theorems & propositions) 


Truth unknown 


(Conjectures) 


Known to be false 


Examples: 


e Pythagorean theorem 


e Fermat’s last theorem 
(Section 2.1) 


e The square of an odd 
number is odd. 


CO 

1 

¢ The series )' — diverges. 
kak 


Examples: 


All perfect numbers are 
even. 


Any even number greater 
than 2 is the sum of two 
primes. (Goldbach’s 
conjecture, Section 2.1) 


e There are infinitely many 


prime numbers of form 


Examples: 


e All prime numbers are 
odd. 

e Some quadratic equations 
have three solutions. 

- 0-1 


e There exist natural 
numbers a,b and c 
for which a? +b? = c?. 


2” -1, with neN. 


Mathematicians spend much of their time and energy attempting 
to prove or disprove conjectures. (They also expend considerable mental 
energy in creating new conjectures based on collected evidence or intuition.) 
When a conjecture is proved (or disproved) the proof or disproof will 
typically appear in a published paper, provided the conjecture is of sufficient 
interest. If it is proved, the conjecture attains the status of a theorem or 
proposition. If it is disproved, then no one is really very interested in it 
anymore—mathematicians do not care much for false statements. 

Most conjectures that mathematicians are interested in are quite 
difficult to prove or disprove. We are not at that level yet. In this text, the 
“conjectures” that you will encounter are the kinds of statements that an 
experienced mathematician would immediately spot as true or false, but 
you may have to do some work before figuring out a proof or disproof. But 
in keeping with the cloud of uncertainty that surrounds conjectures at the 
advanced levels of mathematics, most exercises in this chapter (and many 
beyond it) will ask you to prove or disprove statements without giving any 
hint as to whether they are true or false. Your job will be to decide whether 
or not they are true and to either prove or disprove them. The examples 
in this chapter will illustrate the processes one typically goes through in 
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deciding whether a statement is true or false, and then verifying that it’s 
true or false. 

You know the three major methods of proving a statement: direct proof, 
contrapositive proof and proof by contradiction. Now we are ready to 
understand the method of disproving a statement. Suppose you want to 
disprove a statement P. In other words you want to prove that P is false. 
The way to do this is to prove that ~ P is true, for if ~ P is true, it follows 
immediately that P has to be false. 


How to disprove P: Prove ~P. 


Our approach is incredibly simple. To disprove P, prove ~ P. In theory, 
this proof can be carried out by direct, contrapositive or contradiction 
approaches. However, in practice things can be even easier than that 
if we are disproving a universally quantified statement or a conditional 
statement. That is our next topic. 


9.1 Disproving Universal Statements: Counterexamples 


A conjecture may be described as a statement that we hope is a theorem. 
As we know, many theorems (hence many conjectures) are universally 
quantified statements. Thus it seems reasonable to begin our discussion 
by investigating how to disprove a universally quantified statement such as 


VxeS,P(x). 
To disprove this statement, we must prove its negation. Its negation is 
~(VxeES,P(x)) = AxeS,~ P(x). 


The negation is an existence statement. To prove the negation is true, 
we just need to produce an example of an x¢€ S that makes ~ P(x) true, 
that is, an x that makes P(x) false. This leads to the following outline for 
disproving a universally quantified statement. 


How to disprove Vx « S,P(x). 


Produce an example of an xe S 
that makes P(x) false. 
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Things are even simpler if we want to disprove a conditional statement 
P(x) > Q(x). This statement asserts that for every x that makes P(x) true, 
Q(x) will also be true. The statement can only be false if there is an x that 
makes P(x) true and Q(x) false. This leads to our next outline for disproof. 


How to disprove P(x) > Q(x). 


Produce an example of an x that 
makes P(x) true and Q(x) false. 


In both of the above outlines, the statement is disproved simply by 
exhibiting an example that shows the statement is not always true. (Think 
of it as an example that proves the statement is a promise that can be 
broken.) There is a special name for an example that disproves a statement: 
It is called a counterexample. 


Example 9.1 As our first example, we will work through the process of 
deciding whether or not the following conjecture is true. 


Conjecture: For every n€« Z, the integer f(n) =n? —n+11 is prime. 


In resolving the truth or falsity of a conjecture, it’s a good idea to gather 
as much information about the conjecture as possible. In this case let’s 
start by making a table that tallies the values of f(n) for some integers n. 


n | -3 —2 -1 0 1 2 38 4 5 6 7 8 9 10 


fin) | 28 17 #138 #%11 #11 =é18~«#«217~«23 «031 «41 «#253 «#67 «83 = «101 


In every case, f(n) is prime, so you may begin to suspect that the conjecture 
is true. Before attempting a proof, let’s try one more n. Unfortunately, 
f(11) = 117-11+11= 11? is not prime. The conjecture is false because n = 11 
is a counterexample. We summarize our disproof as follows: 


Disproof. The statement “For every n€ Z, the integer f(n)=n2-—n+11 is 
prime,” is false. For a counterexample, note that for n = 11, the integer 
f(11)=121=11-11 is not prime. a 


In disproving a statement with a counterexample, it is important to explain 
exactly how the counterexample makes the statement false. Our work 
would not have been complete if we had just said “for a counterexample, 
consider n= 11,” and left it at that. We need to show that the answer f(11) 
is not prime. Showing the factorization f(11)= 11-11 suffices for this. 
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Example 9.2 Either prove or disprove the following conjecture. 


Conjecture If A,B and C are sets, then A-—(BNC)=(A-B)n(A-C). 


Disproof. This conjecture is false because of the following counterexample. 
Let A = {1,2,3}, B= {1,2} and C = {2,3}. Notice that A—(BnNC) = {1,3} and 
(A-B)nN(A-C)=@, so A-(BNC)A(A-B)N(A-C). BH 


(To see where this counterexample came from, draw Venn diagrams for 
A-(BnC) and (A-B)n(A-C). You will see that the diagrams are different. 
The numbers 1, 2 and 3 can then be inserted into the regions of the 
diagrams in such a way as to create the above counterexample.) 


9.2 Disproving Existence Statements 


We have seen that we can disprove a universally quantified statement or a 
conditional statement simply by finding a counterexample. Now let’s turn 
to the problem of disproving an existence statement such as 


dxeS,P(x). 


Proving this would involve simply finding an example of an x that makes 
P(x) true. To disprove it, we have to prove its negation ~ (Ax € S,P(x)) = 
Vx eS,~ P(x). But this negation is universally quantified. Proving it 
involves showing that ~ P(x) is true for all xe S, and for this an example 
does not suffice. Instead we must use direct, contrapositive or contradiction 
proof to prove the conditional statement “If x¢S, then ~ P(x).” As an 
example, here is a conjecture to either prove or disprove. 


Example 9.3 Either prove or disprove the following conjecture. 


Conjecture: There is a real number x for which x* <x <x?. 


This may not seem like an unreasonable statement at first glance. After 
all, if the statement were asserting the existence of a real number for 
which x? <x <x?, then it would be true: just take x = —2. But it asserts 
there is an x for which x* <x <x”. When we apply some intelligent guessing 
to locate such an x we run into trouble. If x= 55 then x* <x, but we don’t 
have x <x?; similarly if x = 2, we have x <x? but not x+ <x. Since finding 
an x with x*+ <x <x? seems problematic, we may begin to suspect that the 
given statement is false. 

Let’s see if we can disprove it. According to our strategy for disproof, 
to disprove it we must prove its negation. Symbolically, the statement is 
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4xe€R,x*<x<x?, so its negation is 


~ (Ax é€R,x* <x <x”) = VxER,~ (x4 <x<x?). 


Thus, in words the negation is: 


For every real number x, it is not the case that x* <x <x?. 


This can be proved with contradiction, as follows. Suppose for the 
sake of contradiction that there is an x for which x* <x<.x?. Then x must 
be positive since it’s greater than the non-negative number x*. Dividing 
all parts of x4 <x <x? by the positive number x produces x° <1<x. Now 
subtract 1 from all parts of x? <1<-x to obtain x«?-1<0<x-1 and reason 
as follows: 


x-1 <0< x-1 
(x—-1(x?+x4+1) <0 < (x-1) 


xetxt1 <O0< 1 


(Division by x-—1 did not reverse the inequality < because the second line 
above shows 0<x-1, that is, x—1 is positive.) Now we have x2+x+1<0, 
which is a contradiction because x is positive. 

We summarize our work as follows. 

The statement “There is a real number x for which x* <x <x?” is false 
because we have proved its negation “For every real number x, it not the 
case that x4<x<x?.” 


As you work the exercises, keep in mind that not every conjecture will be 
false. If one is true, then a disproof is impossible and you must produce a 
proof. Here is an example: 


Example 9.4 Either prove or disprove the following conjecture. 


Conjecture There exist three integers x,y,z, all greater than 1 and no 
two equal, for which x = y’. 


This conjecture is true. It is an existence statement, so to prove it we 
just need to give an example of three integers x,y,z, all greater than 1 and 
no two equal, so that x” = y*. A proof follows. 


Proof. Note that if x =2, y=16 and z =4, then x” = 216 =(2*)4=164=,7. @ 
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9.3 Disproof by Contradiction 


Contradiction can be a very useful way to disprove a statement. To see 
how this works, suppose we wish to disprove a statement P. We know 
that to disprove P, we must prove ~ P. To prove ~ P with contradiction, 
we assume ~~ P is true and deduce a contradiction. But since ~~ P =P, 
this boils down to assuming P is true and deducing a contradiction. Here 
is an outline: 


How to disprove P with contradiction: 


Assume P is true, and deduce a contradiction. 


To illustrate this, let’s revisit Example 9.3 but do the disproof with 
contradiction. You will notice that the work duplicates much of what we 
did in Example 9.3, but is it much more streamlined because here we do 
not have to negate the conjecture. 


Example 9.5 Disprove the following conjecture. 


Conjecture: There is a real number x for which x* <x <x?. 


Disproof. Suppose for the sake of contradiction that this conjecture is true. 
Let x be a real number for which x* <x <x”. Then x is positive, since it is 
greater than the non-negative number x*. Dividing all parts of x* <x < x? 
by the positive number «x produces x? < 1<x. Now subtract 1 from all parts 
of x? <1<-x to obtain x? -1<0<«-—1 and reason as follows: 


x®-1 <0< x-1 
(x—-1)(x?+x4+1) <0 < («-1) 


xetxtl <O0< 1 


Now we have x?+x+1<0, which is a contradiction because «x is positive. 
Thus the conjecture must be false. a 


Exercises for Chapter 9 


Each of the following statements is either true or false. If a statement is true, 
prove it. If a statement is false, disprove it. These exercises are cumulative, 
covering all topics addressed in Chapters 1-9. 


1. If x,yeR, then |x+y| = |x|+|yI. 
2. For every natural number n, the integer 2n?— 4n+31 is prime. 
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13. 
14. 
15. 
16. 
17. 
18. 
19. 
20. 
21. 
22. 
23. 
24. 
25. 
26. 
27. 
28. 
29. 
30. 
31. 
32. 
33. 


34. 


. IfneZ and n®*—n is even, then n is even. 


For every natural number n, the integer n?+17n+17 is prime. 
If A, B,C and D are sets, then (A x B)U(C x D) = (AUC) x (BUD). 
If A, B,C and D are sets, then (A x B)n(C x D) =(ANC) x (BnD). 
If A, B and C are sets, and Ax C=BxC, then A=B. 

If A, B and C are sets, then A—(BUC)=(A-B)U(A-C). 


. If A and B are sets, then A(A)—- A(B)S A(A-B). 
10. 
11. 
12. 


If A and B are sets and AnNB=@, then A(A)- A(B)S AA-B). 
If a,beN, then a+b<ab. 


If a,b,c €N and ab, bc and ac all have the same parity, then a,b and c¢ all have 
the same parity. 


There exists a set X for which RC X and Ge X. 

If A and B are sets, then A(A)n A(B) = AANB). 

Every odd integer is the sum of three odd integers. 

If A and B are finite sets, then |A UB|=|A|+|BI. 

For all sets A and B, if A-B=@, then B#@. 

If a,b,c EN, then at least one of a—b, a+c and b—c is even. 

For every r,s € Q with r<s, there is an irrational number u for which r<u<s. 
There exist prime numbers p and gq for which p — gq = 1000. 

There exist prime numbers p and q for which p —q = 97. 

If p and q are prime numbers for which p <q, then 2p + q? is odd. 

If x,ye€R and x°< y®, then x<y. 

The inequality 2* =>x+1 is true for all positive real numbers x. 

For all a,b,ceé Z, if a|bc, then a|b or ale. 

Suppose A, B and C are sets. If A=B-C, then B=AUC. 

The equation x? = 2” has three real solutions. 

Suppose a,be Z. If a|b and bla, then a=b. 

If x,yeR and |x+y|=|x—yl|, then y=0. 

There exist integers a and b for which 42a + 7b =1. 

No number (other than 1) appears in Pascal’s triangle more than four times. 
If n,keN and (%) is a prime number, then k=1 or k=n-1. 


Suppose f(x) = a9 +a 1x+a9x2+---+a,x” is a polynomial of degree 1 or greater, 


and for which each coefficient a; is in N. Then there is an n€N for which the 
integer f(n) is not prime. 


If X CAUB, then XCA or X CB. 


CHAPTER 10 


Mathematical Induction 


his chapter explains a powerful proof technique called mathematical 

induction (or just induction for short). To motivate the discussion, 

let’s first examine the kinds of statements that induction is used to prove. 
Consider the following statement. 


Conjecture. The sum of the first n odd natural numbers equals n?. 


The following table illustrates what this conjecture says. Each row 
is headed by a natural number n, followed by the sum of the first n odd 
natural numbers, followed by n?. 


n | sum of the first n odd natural numbers | n? 
Deo) DS oe eke he ecececs is eee a eee de 1 
yaa (hal es ies eee a ee ee me 4 
Oe | MB eS cadee ocr inate apy Be cree rena Gratesaneeasees 9 
A A SED + TD Sis ek Ae ahead wade iene 16 
5B | 148454 749 = coe ccc ccc cee cece 25 
n | 14+3454+74+9411+---+(2n-De=....... n? 


Note that in the first five lines of the table, the sum of the first n odd 
numbers really does add up to n?. Notice also that these first five lines 
indicate that the n” odd natural number (the last number in each sum) 
is 2n-1. (For instance, when n = 2, the second odd natural number is 
2-2-—1=8; when n=83, the third odd natural number is 2:3-1=5, etc.) 

The table raises a question. Does the sum 1+3+5+7+---+(2n-—1) really 
always equal n?? In other words, is the conjecture true? 

Let’s rephrase this as follows. For each natural number n (i.e., for each 
line of the table), we have a statement S,,, as follows: 
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S,:1=1 
So:14+3=2? 
S3:14+3+5=837 


Sn t14+3454+74+---+(2n-lD=n? 


Our question is: Are all of these statements true? 

Mathematical induction is designed to answer just this kind of question. 
It is used when we have a set of statements S1,S9,S3,...,S,,..., and we 
need to prove that they are all true. The method is really quite simple. 
To visualize it, think of the statements as dominoes, lined up in a row. 
Imagine you can prove the first statement S,, and symbolize this as 
domino S; being knocked down. Additionally, imagine that you can prove 
that any statement S;, being true (falling) forces the next statement S;,1 
to be true (to fall). Then S, falls, and knocks down S2. Next Se falls and 
knocks down S3, then Ss knocks down S4, and so on. The inescapable 
conclusion is that all the statements are knocked down (proved true). 


The Simple Idea Behind Mathematical Induction 


Then all must fall (i.e. all statements are proved true). 
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This picture gives our outline for proof by mathematical induction. 


Outline for Proof by Induction 
Proposition The statements S1,S2,S3,S4,... are all true. 


Proof. (Induction) 

(1) Prove that the first statement Sj is true. 

(2) Given any integer k =1, prove that the statement S; > S;,, is true. 
It follows by mathematical induction that every S, is true. a 


In this setup, the first step (1) is called the basis step. Because Sj is 
usually a very simple statement, the basis step is often quite easy to do. 
The second step (2) is called the inductive step. In the inductive step 
direct proof is most often used to prove Sz; > Sz11, so this step is usually 
carried out by assuming S; is true and showing this forces Sz, to be true. 
The assumption that S; is true is called the inductive hypothesis. 

Now let’s apply this technique to our original conjecture that the sum 
of the first n odd natural numbers equals n?. Our goal is to show that for 
each néN, the statement S,:1+3+5+7+---+(2n-1)=n? is true. Before 
getting started, observe that S; is obtained from S,, by plugging & in for n. 
Thus S;, is the statement S,:1+3+5+7+---+(2k-1)=k?. Also, we get Sy41 
by plugging in k+1 for n, so that S,41:1+34+5+7+-:-+(2k+1)-1)=(k+1). 


Proposition Ifne¢N, then 14+3+5+7+---+(2n-1)=n?. 
Proof. We will prove this with mathematical induction. 
(1) Observe that if n =1, this statement is 1 = 17, which is obviously true. 


(2) We must now prove Sz; > Sz11 for any k=1. That is, we must show 
that if 14+3+5+7+---+(2k-1)=k?, then 1+3+5+7+-:-+(2(k+1)-1) =(k+1)?. 
We use direct proof. Suppose 1+3+5+7+---+(2k-1)=k?. Then 


g Rete ey os ay BN errr ee Pee +(2k+1)-1) = 
14+34+54+7+---+ (2k-1)+(2k+1)-1) = 
(1+34+54+7+---+(2k-1)+(2QR+1)-1) = 

k2 +(2k+1)-1) = hk? +2k4+1 
(k+1). 


Thus 1+34+5+7+---+(2k+1)-1)=(k+1)?. This proves that S; > Sp41. 


It follows by induction that 1+3+5+7+---+(2n-1)=n? foreveryneN. @ 
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In induction proofs it is usually the case that the first statement 
S; is indexed by the natural number 1, but this need not always be so. 
Depending on the problem, the first statement could be So, or S,, for any 
other integer m. In the next example the statements are So,S1,S2,S3.... 
The same outline is used, except that the basis step verifies So, not Sj. 


Proposition If 7 is a non-negative integer, then 5|(n°—-7n). 


Proof. We will prove this with mathematical induction. Observe that the 
first non-negative integer is 0, so the basis step involves n =0. 


(1) If n=0, this statement is 5|(0°-—0) or 5|0, which is obviously true. 


(2) Let k=>0. We need to prove that if 5|(k°—&), then 5|((k+ 1)°-(k +1)). 
We use direct proof. Suppose 5|(k°—zk). Thus k®—k =5a for some a€ Z. 
Observe that 


(k+1)?-(R+1) = kh? +5k*+10k°+10k74+5k+1-k-1 
= (k°-k)+5k*+10k?+ 10k? +5k 
= 5at+5k*+10k°+10k2+5k 
= Blat+hk*+2k? + 2k" +k). 


This shows (k+1)°-—(k+1) is an integer multiple of 5, so 5|((k+1)°-(k+1)). 
We have now shown that 5|(k°—) implies 5|((k + 1)° -(k + 1)). 


It follows by induction that 5|(n®>—7n) for all non-negative integers n. 


As noted, induction is used to prove statements of the form VneEN,Sy. 
But notice the outline does not work for statements of form Vn €« Z,S, 
(where n is in Z, not N). The reason is that if you are trying to prove 
YneéZ,S, by induction, and you’ve shown S, is true and S; > Sz41, then 
it only follows from this that S, is true for n=1. You haven’t proved 
that any of the statements So,S_1,S_9,... are true. If you ever want to 
prove VneéZ,S, by induction, you have to show that some S, is true and 
Sz > Spit and Sz > Sp-1. 

Unfortunately, the term mathematical induction is sometimes confused 
with inductive reasoning, that is, the process of reaching the conclusion 
that something is likely to be true based on prior observations of similar 
circumstances. Please note that that mathematical induction, as intro- 
duced here, is a rigorous method that proves statements with absolute 
certainty. 
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To round out this section, we present four additional induction proofs. 


n 
Proposition Ifne¢Z and 7n=0, then )i-i!=(n+D!-1. 
i=0 


Proof. We will prove this with mathematical induction. 
(1) If n=0, this statement is 


0 
Vi-i!=O+D!-1. 
i=0 


Since the left-hand side is 0-0!=0, and the right-hand side is 1!-1=0, 
the equation yogi =(0+1)!-—1 holds, as both sides are zero. 

(2) Consider any integer k =>0. We must show that Sz implies Sz. That 
is, we must show that 


k 
Yi-il=(k+D!-1 
i=0 


implies 
k+1 
Yo i-i!=(R+)4D!-1. 
i=0 


hk 
We use direct proof. Suppose )i-i!=(k+1)!-1. Observe that 
i=0 


k+1 k 
yak = [3a] «ces nee 
i=0 


i=0 
= (e+0!-1) ++ D+! 
= (k+1)!+(R+1)(R4+1)!-1 
= (1+(+D)(&+D!-1 
= (k+2\(k+D!-1 
= (k+2)!-1 
= ((k+1)+D!-1. 
k+1 


Therefore ) i-i! = ((k+1)+1)!-1. 
i=0 


n 
It follows by induction that )°i-i!=(n+1)!-1 for every integern>0. 
i=0 
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The next example illustrates a trick that you may occasionally find 
useful. You know that you can add equal quantities to both sides of an 
equation without violating equality. But don’t forget that you can add 
unequal quantities to both sides of an inequality, as long as the quantity 
added to the bigger side is bigger than the quantity added to the smaller 
side. For example, if x< y and a<b, then x+a<y+b. Similarly, if x <y 
and b is positive, then x< y+b. This oft-forgotten fact is used in the next 
proof. 


Proposition For each néN, it follows that 2” <2"t!-2"-1-1., 
Proof. We will prove this with mathematical induction. 


(1) If n =1, this statement is 2! < 2!*4-2!1-1, which simplifies to 
2<4-1-1, which is obviously true. 


(2) Suppose k= 1. We need to show that 2” < 2+! 9*-1_1 implies 
Qk+l < ok+I)+1_9(k+))-1_1, We use direct proof. Suppose 2” < 2?+1!—-2’-1_1, 
and reason as follows: 


gk < gktl_ok-1_y 
a2") < 2(2*t1-2'-1_1) (multiply both sides by 2) 
gkt1 < gkt2_ok_9 
getl < gkt2_o9k_944 (add 1 to the bigger side) 
It follows by induction that 2” < 2"*1-2”"-1!_1 for each neEN. a 


The next proposition proves that if ne N, then the inequality (1+x)” = 
1+nx holds for all x€R with x >—-1. Thus we will need to prove that the 
statement 


S,:(1+x)”=1+nx for every x€R with x>-1 


is true for every natural number n. (For the record, the inequality (1+x)” = 
1+nx is sometimes known as Bernoulli’s inequality.) 
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Proposition If neéN, then (1+x)” =1+nx for all xe R with x >-1. 


Proof. We will prove this with mathematical induction. 


(1) For the basis step, notice that when n=1 the statement is (1+x)'= 
1+1-x, and this is true because both sides equal 1+ x. 


(2) Assume that for some k = 1, the statement (1+x)*>1+k«x is true for 
all x€R with x >-1. From this we need to prove (1+ x)*t!>1+(k+ x. 
Now, 1+. is positive because x >-—1, so we can multiply both sides of 
(1+x)* =>1+kx by (1+) without changing the direction of the =. 


(1+x)*(1+x) = (1+kx)(1+x) 
yerr 


V 


> lixthxt+hx? 
tl Ss 14 (k41)x + kx? 


(1+x 
(1+x 


The above term kx? is positive, so removing it from the right-hand side 
will only make that side smaller. Thus we get (l+x)**!>1+(k+1)x. 


Next, an example where the basis step involves more than routine 
checking. 


Proposition Suppose aj,a2,...,a, are n integers, where n= 2. If p is 
prime and p|(a1-a2-a3:--ap), then p|a; for at least one of the a;. 


Proof. The proof is induction on n. 


(1) The basis step involves n= 2. Let p be prime and suppose p|(a,a2). 
We need to show that p|aj, or p|ag, or equivalently, if p{a1, then 
p|ag. Thus suppose p{a,. Since p is prime, it follows that gcd(p,a1)= 1. 
By the proposition on page 124, there are integers k and ¢ for which 
1=pk+a;é. Multiplying this by ag gives 


ag = pkagnt+ajagl. 


As we are assuming that p divides aja, it is clear that p divides the 
expression pkag+a af on the right; hence p|az. We’ve now proved that 
if p | (a ag), then p|a, or p|ag. This completes the basis step. 


(2) Suppose that k = 2, and p|(a,-a2---az) implies then p|a; for some aj. 
Now let p | (a1-a2---ag-ap41). Then p| ((ai1-a2---az)-a%41). By what we 
proved in the basis step, it follows that p |(a,-a2---az) or p|apz41. This 
and the inductive hypothesis imply that p divides one of the aj. a 


Please test your understanding now by working few exercises. 
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10.1 Proof by Strong Induction 


This section discribes a useful variation on induction. 

Occasionally it happens in induction proofs that it is difficult to show 
that S; forces S;.; to be true. Instead you may find that you need to use 
the fact that some “lower” statements S,, (with m < k) force S;,1 to be true. 
For these situations you can use a slight variant of induction called strong 
induction. Strong induction works just like regular induction, except that 
in Step (2) instead of assuming S; is true and showing this forces S;,1 
to be true, we assume that all the statements S1,So,...,S, are true and 
show this forces Sz; to be true. The idea is that if it always happens that 
the first k dominoes falling makes the (k+1)*” domino fall, then all the 
dominoes must fall. Here is the outline. 


Outline for Proof by Strong Induction 
Proposition The statements S1,S2,S3,S4,... are all true. 


Proof. (Strong induction) 
(1) Prove the first statement S;. (Or the first several S,,.) 
(2) Given any integer k =1, prove (Sj; A SegAS3A-:-AS,)>Sp41. Wf 


Strong induction can be useful in situations where assuming S;, is true 
does not neatly lend itself to forcing S;,1 to be true. You might be better 
served by showing some other statement (S;_1; or S;-2 for instance) forces 
S; to be true. Strong induction says you are allowed to use any (or all) of 
the statements S1,So,...,S, to prove Sz+1. 

As our first example of strong induction, we are going to prove that 
12 |(n4—n?) for any ne N. But first, let’s look at how regular induction 
would be problematic. In regular induction we would start by showing 
12|(n* —n?) is true for n=1. This part is easy because it reduces to 12|0, 
which is clearly true. Next we would assume that 12|(k+—k?) and try to 
show this implies 12|((k+1)*—(k+1)?). Now, 12|(k*—k?) means k4—k? = 12a 
for some a¢€ Z. Next we use this to try to show (k + 1)*-(k +1)? = 126 for 
some integer b. Working out (k + 1)4—(k+ 1)”, we get 


(k+1)*-(k+1)? = (k44+4k?+6k24+4k+1)—(k274+2k41) 


(k* — kh?) + 4k? + 6k? +6k 
12a +4k? + 6k? + 6k. 


At this point we’re stuck because we can’t factor out a 12. Now let’s see 
how strong induction can get us out of this bind. 
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Strong induction involves assuming each of statements Sj,S9,...,S, is 
true, and showing that this forces S;,; to be true. In particular, if S; 
through S, are true, then certainly S;_5 is true, provided that 1<k-5<k. 
The idea is then to show S;_5 > S;41 instead of S; > S,,1. For this to 
make sense, our basis step must involve checking that S1,S2,S3,S4,S5,S¢ 
are all true. Once this is established, S;,_5 > S,+1 will imply that the other 
S;, are all true. For example, if k =6, then S;_5 > S;41 is Sj > S87, so S7 is 
true; for k=7, then Sz_5 > Sz41 is So > Sg, so Sg is true, etc. 


Proposition If neéN, then 12| (n* —n?). 
Proof. We will prove this with strong induction. 


(1) First note that the statement is true for the first six positive integers: 
If n =1, 12 divides n*-n? = 14-17 =0. 
If n =2, 12 divides n* — n? = 24-2? = 12. 
If n =3, 12 divides n*—n? = 34 - 32 =72. 
If n=4, 12 divides n4-—n? = 44 — 4? = 240. 
If n=5, 12 divides n*—n? = 54-57 = 600. 
If n =6, 12 divides n*—n? = 64 — 6? = 1260. 
(2) Let k=>6 and assume 12|(m*-m?) for 1<m<k. (That is, assume 
statements $1,S2,...,S; are all true.) We must show 12| ((k+1)4-(k+ 1”). 


(That is, we must show that S;,, is true.) Since Sz_5 is true, we have 
12|((k—5)* —-(k—5)?). For simplicity, let’s set | m=k-5, | so we know 


12|(m4—m?), meaning| m*—m?= 12a | for some integer a. Observe that: 


(k+1)*-(k+1)? = (m+6)*-(m+6)" 
= m*+24m? +216m? + 864m + 1296 —(m? + 12m + 36) 
= (m*—m?)+24m? + 216m? + 852m + 1260 
= 12a+24m? + 216m? + 852m +1260 
= 12(a+2m?+18m?+71m +105). 


As (a+ 2m? +18m?2+71m+105) is an integer, we get 12|((k+1)*-—(k +1)?). 


This shows by strong induction that 12 |(n+—n?) for every ne€N. a 
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Our next example involves mathematical objects called graphs. In 
mathematics, the word graph is used in two contexts. One context involves 
the graphs of equations and functions from algebra and calculus. In 
the other context, a graph is a configuration consisting of points (called 
vertices) and edges which are lines connecting the vertices. Following 
are some pictures of graphs. Let’s agree that all of our graphs will be in 
“one piece,” that is, you can travel from any vertex of a graph to any other 
vertex by traversing a route of edges from one vertex to the other. 


rR EN 


Figure 10.1. Examples of Graphs 


A cycle in a graph is a sequence of distinct edges in the graph that 
form a route that ends where it began. For example, the graph on the 
far left of Figure 10.1 has a cycle that starts at vertex v;, then goes to vg, 
then to vs, then v4 and finally back to its starting point v1. You can find 
cycles in both of the graphs on the left, but the two graphs on the right do 
not have cycles. There is a special name for a graph that has no cycles; 
it is called a tree. Thus the two graphs on the right of Figure 10.1 are 
trees, but the two graphs on the left are not trees. 


Figure 10.2. A tree 


Our next example concerns trees. Note that the trees in Figure 10.1 
both have one fewer edge than vertices. For example, the tree on the far 
right has 5 vertices and 4 edges. The one next to it has 6 vertices and 5 
edges. Draw any tree; you will find that if it has n vertices, then it has 
n—1 edges. We now prove that this is true for any tree. 
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Proposition Ifa tree has n vertices, then it has n-1 edges. 


Proof. Notice that this theorem asserts that for any néN, the following 
statement is true: S,: A tree with n vertices has n—1 edges. We use strong 
induction to prove this. 
(1) Observe that if a tree has n = 1 vertex then it has no edges. Thus it 
has n-1=0 edges, so the theorem is true when n= 1. 


(2) Now take an integer k=>1. We must show (S1AS2/A-:-A Sz) > Spi 1. 
In words, we must show that if it is true that any tree with m vertices 
has m-1 edges, where 1<m<zk, then any tree with k+1 vertices has 
(k+1)—-1=k edges. We will use direct proof. 


Suppose that for each integer m with 1<m<zk, any tree with m vertices 
has m-1 edges. Now let T be a tree with k +1 vertices. Single out an 
edge of T and label it e, as illustrated below. 


a a 
‘ e 


s ¢ s 


Dy one <e 
Now remove the edge e from T, but leave the two endpoints of e. This 
leaves two smaller trees that we call T; and T . Let’s say T; has x 
vertices and T2 has y vertices. As each of these two smaller trees has 
fewer than k+1 vertices, our inductive hypothesis guarantees that T, 
has x—1 edges, and Tz has y—1 edges. Think about our original tree T. 
It has x+y vertices. It has x—1 edges that belong to T; and y—1 edges 
that belong to JT, plus it has the additional edge e that belongs to 
neither T; nor Tz. Thus, all together, the number of edges that T has is 
(x-1)+(y-1)+1=(x+y)-1. In other words, T has one fewer edges than 
it has vertices. Thus it has (k+1)—1=k edges. 


It follows by strong induction that a tree with n vertices has n—ledges. @ 


Notice that it was absolutely essential that we used strong induction 
in the above proof because the two trees T; and Ty», will not both have k 
vertices. At least one will have fewer than k vertices. Thus the statement 
S; is not enough to imply S;.;. We need to use the assumption that S,, 
will be true whenever m <k, and strong induction allows us to do this. 
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10.2 Proof by Smallest Counterexample 


This section introduces yet another proof technique, called proof by small- 
est counterexample. It is a hybrid of induction and proof by contradiction. 
It has the nice feature that it leads you straight to a contradiction. It 
is therefore more “automatic” than the proof by contradiction that was 
introduced in Chapter 6. Here is the outline: 


Outline for Proof by Smallest Counterexample 
Proposition The statements S1,S2,S3,S4,... are all true. 


Proof. (Smallest counterexample) 

(1) Check that the first statement Sj is true. 

(2) For the sake of contradiction, suppose not every S,, is true. 

(3) Let k>1 be the smallest integer for which S; is false. 

(4) Then S;_; is true and S; is false. Use this to get a contradiction. 


Notice that this setup leads you to a point where S;,; is true and 
S; is false. It is here, where true and false collide, that you will find a 
contradiction. Let’s look at an example. 


Proposition If neéeN, then 4|(5”—-1). 


Proof. We use proof by smallest counterexample. (We will number the 
steps to match the outline, but that is not usually done in practice.) 
(1) If n=1, then the statement is 4|(5!- 1), or 4|4, which is true. 


(2) For sake of contradiction, suppose it’s not true that 4|(5”-1) for all n. 
(3) Let k>1 be the smallest integer for which 4{ (5” — 1). 
(4) Then 4|(5*-!-1), so there is an integer a for which 5’~!—-1=4a. Then: 


5e-1_1 = da 
5(5*-1_-1) = 5-4a 
5*°-5 = 20a 
5*_-1 = 20a+4 
5*-1 = A(5a+1) 


This means 4|(5*—1), a contradiction, because 4} (5*—1) in Step 3. Thus, 
we were wrong in Step 2 to assume that it is untrue that 4|(5”-—1) for 
every n. Therefore 4|(5”-—1) is true for every n. | 
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Our next example proves the fundamental theorem of arithmetic, 
which says any integer greater than 1 has a unique prime factorization. For 
example, 12 factors into primes as 12 = 2-2-3, and moreover any factorization 
of 12 into primes uses exactly the primes 2, 2 and 3. Our proof combines 
the techniques of induction, cases, minimum counterexample and the idea 
of uniqueness of existence outlined at the end of Section 7.3. 


Proposition Any integer n> 1 has a unique prime factorization. That is, 
if n = p1-p2:p3°:: pr and n=a -a9-a3::-a¢ are two factorizations of n into 
primes, then k = @, and the primes p; and a; are the same, except that 
they may be in a different order. 


Proof. Suppose n> 1 is an integer. We first use strong induction to show 
that n has a prime factorization. For the basis step, if n = 2, it is prime, 
so it is already its own prime factorization. Let n > 2 and assume every 
integer between 2 and n (inclusive) has a prime factorization. Consider 
n+1. If n+1is prime, then it is its own prime factorization. If n+1 is 
not prime, then it factors as n+1=ab with a,b>1. Because a and 6 are 
both less than n+1 they have prime factorizations a = p1-p2-p3-::pp and 
b =p: Po°P3°**p,y. Then 


is a prime factorization of n+1. This competes the proof by strong induction 
that every integer greater than 1 has a prime factorization. 

Next we use proof by smallest counterexample to prove that the prime 
factorization of any n= 2 is unique. If n = 2, then it clearly has only one 
prime factorization, namely itself. Assume for the sake of contradiction that 
there is an n that has two different prime factorizations n = p1-p2:p3::-Pr 
and n =a 1-d9:a3:::ag. From n= pj: po: p3:::pr, we see that p;|n, so 
P1|(a1,:a2-a3::-a¢). By the proposition at the bottom of page 158, pj, 
divides one of the primes a;. As a; is prime, we have p; =a;. Dividing 
N= Pi: P2*P3***Pk = 41:a2-a3°--a¢ by pi yields 


P2°*P3°**Pk = G1°A2°A3°*-Aj-1°Aji41°*- ae. 


These two factorizations are different, because the two prime factorizations 
of n were different. (Remember: the primes p; and a; are equal, so the 
difference appears in the remaining factors, displayed above.) But also the 
above number po: p3:::pz is smaller than n, and this contradicts the fact 
that n was the smallest number with two different prime factorizations. 
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One word of warning about proof by smallest counterexample. In proofs 
in other textbooks or in mathematical papers, it often happens that the 
writer doesn’t tell you up front that proof by smallest counterexample 
is being used. Instead, you will have to read through the proof to glean 
from context that this technique is being used. In fact, the same warning 
applies to all of our proof techniques. If you continue with mathematics, 
you will gradually gain through experience the ability to analyze a proof 
and understand exactly what approach is being used when it is not stated 
explicitly. Frustrations await you, but do not be discouraged by them. 
Frustration is a natural part of anything that’s worth doing. 


10.3 Fibonacci Numbers 


Leonardo Pisano, now known as Fibonacci, was an Italian mathematician 
born around 1175. His most significant work was a book Liber Abaci, 
which is recognized as a catalyst in medieval Europe’s slow transition 
from Roman numbers to the Hindu-Arabic number system. But he is best 
known today for a number sequence that he described in his book and 
that bears his name. The Fibonacci sequence is 


1, 1, 2, 3, 5, 8, 13, 21, 34, 55, 89, 144, 233, 377,... 


The numbers that appear in this sequence are called Fibonacci numbers. 
The first two numbers are 1 and 1, and thereafter any entry is the sum of 
the previous two entries. For example 3+5 = 8, and 5+8 = 18, etc. We denote 
the n‘” term of this sequence as F,. Thus Fy =1,F2 =1,F3 = 2,F4=3,F7 = 13 
and so on. Notice that the Fibonacci Sequence is entirely determined by 
the rules Fj =1,F9=1, and F, =Fy-1+Fp-2. 

We introduce Fibonacci’s sequence here partly because it is something 
everyone should know about, but also because it is a great source of induc- 
tion problems. This sequence, which appears with surprising frequency in 
nature, is filled with mysterious patterns and hidden structures. Some of 
these structures will be revealed to you in the examples and exercises. 

We emphasize that the condition F,, = F,_1+F,_2 (or equivalently F,,.1 = 
F,+F,-1) is the perfect setup for induction. It suggests that we can 
determine something about F,, by looking at earlier terms of the sequence. 
In using induction to prove something about the Fibonacci sequence, you 
should expect to use the equation F,, = F,_1+Fn-2 somewhere. 

For our first example we will prove that F?, ,-Fn+if,-F? =(-1)" for 
any natural number n. For example, if n =5 we have F? - F¢Fs — F? = 
8?- 8-5-5? = 64-40-25 = -1=(-1)°. 
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Proposition The Fibonacci sequence obeys F?, ,—Fn+iFn—F? =(-1)". 


Proof. We will prove this with mathematical induction. 
(1) Ifn=1 we have F?, ,—Fniif)-F? = F2-F2F,-F? = 1?-1-1-1%=-1= 
(-1)' = (-1)”, so indeed F?, , — FniiFn — F? =(-1)" is true when n=1. 


(2) Take any integer k => 1. We must show that if F?, ,-FriiF.-F7 =(-1)*, 


then F?,, — Fpi2Fp+1-Fe,, =(-1**!. We use direct proof. Suppose 


Fe, —FriiF, —F? =(-1)*. Now we are going to carefully work out the 


expression ke 49 7 n+ oF +1 -F? ,, and show that it really does equal 
(-1)**!. In so doing, we will use the fact that Fy49=Fr41+Fp. 


Fig Frv2Favi-Feyy = (Pesit Fa)? - Peart Fe Fev —Fes 
= F? ,+2FpiiFpt+ Fp -F?,,-FeFpui-F?,, 
= —F? + FriikF,+F? 
= (Fe, —FriiF,-F;) 
ae a i (inductive hypothesis) 
= (-1)(-1* 
= (-1)*t! 


Therefore F?, ,—Fr+2Fr+1—-F},, = (21)"* 4, 
It follows by induction that F?, ,-Fn+ifn-F?=(-1)" foreveryneN. & 


Let’s pause for a moment and think about what the result we just 
proved means. Dividing both sides of F?, , -Fn+iF,-F? =(-1)" by F? gives 


(Fa) Fy+1 i= (-1)” 
Fy, Fy F2 

For large values of n, the right-hand side is very close to zero, and the 
left-hand side is F,,;/F, plugged into the polynomial x?—x-1. Thus, as 
n increases, the ratio F,41/F, approaches a root of x7-x—1=0. By the 


quadratic formula, the roots of x?—x-—1 are 1V6 | As Fy+1/Fy > 1, this ratio 


must be approaching the positive root 1+ V5 Therefore 


(10.1) 


For a quick spot check, note that F'13/F2 ~ 1.618025, while 1iv6 = 1.618033. 
Even for the small value n = 12, the numbers match to four decimal places. 
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The number @ = 1+v6 is sometimes called the golden ratio, and there 
has been much speculation about its occurrence in nature as well as 
in classical art and architecture. One theory holds that the Parthenon 
and the Great Pyramids of Egypt were designed in accordance with this 
number. 

But we are here concerned with things that can be proved. We close by 
observing how the Fibonacci sequence in many ways resembles a geometric 
sequence. Recall that a geometric sequence with first term a and 
common ratio r has the form 


a, ar, ar’, ar, ar’, ar, ar®, ar’, G3 


where any term is obtained by multiplying the previous term by r. In 
general its n’” term is G, =ar”, and Gni/G, =r. Equation (10.1) tells 
us that F,+1/F, ~ ®. Thus even though it is not a geometric sequence, 
the Fibonacci sequence tends to behave like a geometric sequence with 
common ratio ®, and the further “out” you go, the higher the resemblance. 


Exercises for Chapter 10 


Prove the following statements with either induction, strong induction or proof 
by smallest counterexample. 


no+n 


2 

g_ n(nt+1)(2n +1) 
iter eget 

3 n*(n +1)? 
aamey Tatar 


1. For every integer neN, it follows that 1+2+3+4+---+n= 
2. For every integer ne€N, it follows that 17+27+37+47+---4+n 


3. For every integer n€N, it follows that 1°+2°+37+4°+-.-4+n 


n(n+1)(n+ 2) 


4, IfneN, then 1-24+2-3+3-4+4-5+---+n(nt+1)= 3 


5. If neN, then 91492493 4...49n -gntl_o 


n 
6. For every natural number 2, it follows that )(8i—5) = 4n?—n. 
i=l 


12 

7. If neN, then L:B+2-443-544-64--+nin+2) = ED 
12 3 i 1 
See Ad ee ig ek 
PNP gE BU ple heal (n+ D! 


9. For any integer n= 0, it follows that 24|(52” — 1). 
10. For any integer n= 0, it follows that 3|(52” — 1). 
11. For any integer n =0, it follows that 3| (n°? +5n +6). 
12. For any integer n= 0, it follows that 9 | (42” +8). 
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13. 
14. 


15. 


16. 
17. 


18. 


19. 
20. 
21. 


22. 
23. 


24, 


25. 
26. 


27. 
28. 


29. 


30. 


31. 


32. 


For any integer n= 0, it follows that 6|(n?—7). 
Suppose ae Z. Prove that 5|2”a implies 5|a for any neN. 


1 1 1 1 1 1 

If neN, then — + — + — + —~+ +--+ ——— =1-—.. 

1-2 2-3 3-4 4-5 n(n +1) n+1 
For every natural number n, it follows that 2”+1< 3”. 
Suppose A1,A2,...An are sets in some universal set U, and n=2. Prove that 
AynAgn:::NAy, = AyUAgU::UA,p. 
Suppose A1,A2,...An are sets in some universal set U, and n=2. Prove that 
A,UAgU::-UAn = A1nAgn:::NAn. 


1 1iit1 1 1 
Prove that —+—+—=+:---+ 5 #s2--. 
1 4 9 n2 n 
Prove that (1+2+3+---+n)? =19+2°+3°+---+n3 for every neEN. 
Cease Gees eel Oe | 1 1 
If neN, then —+—+-+—+ +--+ poet, 
12 3 4 5 2?-1 Qn 


(Note: This problem asserts that the sum of the first n? terms of the harmonic 
series is at least 1+n/2. It thus implies that the harmonic series diverges.) 
If neN, then (2 ale le ale a} ls] a 

, 2 A 8 16 ar} 4 gntl’ 
Use mathematical induction to prove the binomial theorem (Theorem 3.1 on 
page 78). You may find that you need Equation (3.2) on page 76. 


n 
Prove that )° k(%)=n2”""' for each natural number n. 
k=l 


Concerning the Fibonacci sequence, prove that F,+Fo+F3+F4+...+F,=Fps9-1. 
n 
Concerning the Fibonacci sequence, prove that )° FB; =F,Fys1- 
k=1 
Concerning the Fibonacci sequence, prove that F,+F3+F5+F7+...+Fon_1= Fon. 


Concerning the Fibonacci sequence, prove that Fo+F4+Fe+Fgt+...+ Fon = 
Fon+1-1. 


In this problem né€N and F, is the n‘” Fibonacci number. Prove that 


(5) + ("12") + ("27) + (75°) +--+ + (2) = Fst. 
(For example, (6) + (?)+ ($)+ (3) + (2) + (4) + (9) = 145+6+14+0+0+0 = 13 = F641.) 
Here F,, is the n*” Fibonacci number. Prove that 
(234) - (454) 


F, = 
V5 


n 
Prove that )° (*)=("t}), where 1<r<n. 
h=0 


Prove that the number of n-digit binary numbers that have no consecutive 
1’s is the Fibonacci number F,,,29. For example, for n =2 there are three such 


Fibonacci Numbers 169 


33. 


34. 


35. 
36. 


numbers (00, 01, and 10), and 3=Fo.,.9=F 4. Also, for n =8 there are five such 

numbers (000, 001, 010, 100, 101), and 5=F3,9=Fs. 

Suppose n (infinitely long) straight lines lie on a plane in such a way that no 

two of the lines are parallel, and no three of the lines intersect at a single 
E 7 wate 2 nz4+n+2 7 

point. Show that this arrangement divides the plane into "~j** regions. 


_ grtl_3 


Prove that 3'+37+3°+34+---+3” for every neN. 


If n,keN, and n is even and k is odd, then (7) is even. 


If n =2* —-1 for some k€N, then every entry in the nth row of Pascal’s triangle 
is odd. 


Part IV 


Relations, Functions and 
Cardinality 


CHAPTER 11 


Relations 


n mathematics there are endless ways that two entities can be related 
to each other. Consider the following mathematical statements. 


5<10 5<5 


6 = 20 5 | 80 7>4 x#y 843 
a=b(modn) 6EZ x 


a 
CY 72314 OF-1 V2¢€Z ZEN 


In each case two entities appear on either side of a symbol, and we 
interpret the symbol as expressing some relationship between the two 
entities. Symbols such as <,<,=,|,{,>,>, € and c, etc., are called relations 
because they convey relationships among things. 

Relations are significant. In fact, you would have to admit that there 
would be precious little left of mathematics if we took away all the relations. 
Therefore it is important to have a firm understanding of relations, and 
this chapter is intended to develop that understanding. 

Rather than focusing on each relation individually (an impossible task 
anyway since there are infinitely many different relations), we will develop 
a general theory that encompasses all relations. Understanding this 
general theory will give us the conceptual framework and language needed 
to understand and discuss any specific relation. 

Before stating the theoretical definition of a relation, let’s look at a 
motivational example. This example will lead naturally to our definition. 

Consider the set A = {1,2,3,4,5}. (There’s nothing special about this 
particular set; any set of numbers would do for this example.) Elements of 
A can be compared to each other by the symbol “<.” For example, 1 <4, 
2<38, 2<4, and so on. You have no trouble understanding this because the 
notion of numeric order is so ingrained. But imagine you had to explain 
it to an idiot savant, one with an obsession for detail but absolutely no 
understanding of the meaning of (or relationships between) integers. You 
might consider writing down for your student the following set: 


R = {(1,2),(1,3), (1,4), (1,5), (2,3), (2,4), (2,5), (3, 4), (3, 5), (4,5) f. 
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The set R encodes the meaning of the < relation for elements in A. An 
ordered pair (a,b) appears in the set if and only if a<b. If asked whether 
or not it is true that 3<4, your student could look through R until he 
found the ordered pair (3,4); then he would know 3<4 is true. If asked 
about 5 <2, he would see that (5,2) does not appear in R, so 5<¢2. The set 
R, which is a subset of A x A, completely describes the relation < for A. 

Though it may seem simple-minded at first, this is exactly the idea 
we will use for our main definition. This definition is general enough to 
describe not just the relation < for the set A = {1,2,3,4,5}, but any relation 
for any set A. 


Definition 11.1 A relation on a set A is a subset RC Ax A. We often 
abbreviate the statement (x,y)¢ R as xRy. The statement (x,y) ¢R is 
abbreviated as xR y. 


Notice that a relation is a set, so we can use what we know about sets 
to understand and explore relations. But before getting deeper into the 
theory of relations, let’s look at some examples of Definition 11.1. 


Example 11.1 Let A = {1,2,3,4}, and consider the following set: 
R= {(1, 1), (2, 1), (2, 2), (3, 3), (3, 2), (3, 1), (4, 4), (4, 3), (4, 2), (4, 1)} CAXxA. 


The set R is a relation on A, by Definition 11.1. Since (1,1)¢ R, we have 
1R1. Similarly 2R1 and 2R2, and so on. However, notice that (for example) 
(3,4)¢R, so 3R4. Observe that R is the familiar relation = for the set A. 

Chapter 1 proclaimed that all of mathematics can be described with 
sets. Just look at how successful this program has been! The greater-than 
relation is now a set R. (We might even express this in the rather cryptic 
form == R.) 


Example 11.2 Let A= {1,2,3,4}, and consider the following set: 
S= {(1, 1), (1,3), (8, 1), (8, 3), (2, 2), (2,4),(4,2),(4,4)} CAXxA. 


Here we have 191, 183, 4S2, etc., but 384 and 2$1. What does S mean? 
Think of it as meaning “has the same parity as.” Thus 181 reads “1 has 
the same parity as 1,” and 4S2 reads “4 has the same parity as 2.” 


Example 11.3 Consider relations R and S of the previous two examples. 
Note that RNS = {(1, 1), (2, 2), (3, 3), (3, 1), (4,4), (4, 2)} ¢ AxA is a relation on A. 
The expression x(RNS)y means “x= y, and x,y have the same parity.” 


175 


Example 11.4 Let B= {0,1,2,3,4,5}, and consider the following set: 
U= {(1,3), (3,3), (5, 2), (2, 5), (4, 2)} CBxB. 


Then U is a relation on B because UC BxB. You may be hard-pressed 
to invent any “meaning” for this particular relation. A relation does not 
have to have any meaning. Any random subset of B x B is a relation on B, 
whether or not it describes anything familiar. 


Some relations can be described with pictures. For example, we can 
depict the above relation U on B by drawing points labeled by elements of 
B. The statement (x, y)¢U is then represented by an arrow pointing from 
x to y, a graphic symbol meaning “x relates to y.” Here’s a picture of U: 


0 i} 2 
3 4 5 


The next picture illustrates the relation R on the set A = {a,b,c,d}, where 
xRy means x comes before y in the alphabet. According to Definition 11.1, 
as a set this relation is R = {(a,b),(a,c),(a,d),(b,c),(b,d),(c,d)}. You may 
feel that the picture conveys the relation better than the set does. They 
are two different ways of expressing the same thing. In some instances 
pictures are more convenient than sets for discussing relations. 


ve 


b Cc 


Although such diagrams can help us visualize relations, they do have 
their limitations. If A and R were infinite, then the diagram would be 
impossible to draw, but the set R might be easily expressed in set-builder 
notation. Here are some examples. 


Example 11.5 Consider the set R = {(x,y)e Zx Z:x—-yeN}CZxZ. This 
is the > relation on the set A = Z. It is infinite because there are infinitely 
many ways to have x > y where x and y are integers. 


Example 11.6 The set R = {(x,x):x¢R}CR*R is the relation = on the 
set R, because xRy means the same thing as x= y. Thus R is a set that 
expresses the notion of equality of real numbers. 
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Exercises for Section 11.0 


1. 


2. 


11. 


Let A = {0,1,2,3,4,5}. Write out the relation R that expresses > on A. Then 
illustrate it with a diagram. 

Let A = {1,2,3,4,5,6}. Write out the relation R that expresses | (divides) on A. 
Then illustrate it with a diagram. 


. Let A = {0,1,2,3,4,5}. Write out the relation R that expresses = on A. Then 


illustrate it with a diagram. 


. Here is a diagram for a relation R on a set A. Write the sets A and R. 


. Here is a diagram for a relation R on a set A. Write the sets A and R. 


0 1 2 


3 4 5 


. Congruence modulo 5 is a relation on the set A = Z. In this relation xR y means 


x=y (mod 5). Write out the set R in set-builder notation. 


. Write the relation < on the set A=Z as a subset R of Z~x Z. This is an infinite 


set, so you will have to use set-builder notation. 


. Let A = {1,2,3,4,5,6}. Observe that @¢ Ax A, so R=@ is a relation on A. Draw 


a diagram for this relation. 


. Let A = {1,2,3,4,5,6}. How many different relations are there on the set A? 
. Consider the subset R =(R x R)— {(x,x):x€R}CRxR. What familiar relation on 


R is this? Explain. 
Given a finite set A, how many different relations are there on A? 


In the following exercises, subsets R of R?=RxR or Z*=ZxZ are indicated by 


gray shading. In each case, R is a familiar relation on R or Z. State it. 
eoo0°0 @oeaoao 


12. 


13. 
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11.1 Properties of Relations 


A relational expression xRy is a statement (or an open sentence); it is either 
true or false. For example, 5< 10 is true, and 10<5 is false. (Thus an 
operation like + is not a relation, because, for instance, 5+10 has a numeric 
value, not a T/F value.) Since relational expressions have T/F values, we 
can combine them with logical operators; for example, xRy => yRx is a 
statement or open sentence whose truth or falsity may depend on x and y. 

With this in mind, note that some relations have properties that others 
don’t have. For example, the relation < on Z satisfies x < x for every x€é Z. 
But this is not so for < because x <x is never true. The next definition 
lays out three particularly significant properties that relations may have. 


Definition 11.2 Suppose R is a relation on a set A. 


1. Relation R is reflexive if xRx for every x¢€ A. 
That is, R is reflexive if VxeA,xRx. 


2. Relation R is symmetric if xRy implies yRx for all x,yeEA 
That is, R is symmetric if Vx,yeA,xRy > yRx. 

3. Relation R is transitive if whenever xRy and yRz, then also xRz. 
That is, R is transitive if Vx,y,z€A,((xRy)A(yRz)) > xRz. 


To illustrate this, let’s consider the set A = Z. Examples of reflexive 
relations on Z include <, =, and |, because x< x, x =x and x|x are all true 
for any x€Z. On the other hand, >, <, 4 and { are not reflexive for none 
of the statements x<x, x>x, x«#x and x{x is true. 

The relation # is symmetric, for if x4 y, then surely y#«x also. Also, 
the relation = is symmetric because x = y always implies y= x. 

The relation < is not symmetric, as x < y does not necessarily imply 
y <x. For instance 5 <6 is true, but 6<5 is false. Notice (x < y) > (y <x) 
is true for some x and y (for example, it is true when x= 2 and y= 2), but 
still < is not symmetric because it is not the case that (x < y) > (y <x) is 
true for all integers x and y. 

The relation < is transitive because whenever x<y and y<z, it also 
is true that x<z. Likewise <,=,> and = are all transitive. Examine the 
following table and be sure you understand why it is labeled as it is. 


Relations on Z: | < < = | of # 

Reflexive no yes yes yeS no no 
Symmetric no no yes no no_ yes 
Transitive yes yes yeS yes no _ no 
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Example 11.7 Here A = {b,c,d,e}, and R is the following relation on A: 
R= {(b,b),(b,c),(c, b),(c,c), (d,d),(b, d),(d,b), (c,d), (d,c)}. 


This relation is not reflexive, for although bRb, cRc and dRd, it is not 
true that eRe. For a relation to be reflexive, xRx must be true for all xe A. 
The relation R is symmetric, because whenever we have xRy, it follows 
that yRx too. Observe that bRc and cRb; bRd and dRb; dRc and cRd. 
Take away the ordered pair (c,b) from R, and R is no longer symmetric. 

The relation R is transitive, but it takes some work to check it. We 
must check that the statement (xRy A yRz)=> <xRz is true for all x,y,zEA. 
For example, taking x = b, y=c and z=d, we have (bRcAcRd) => bRd, 
which is the true statement (TA 7T)=>T. Likewise, (b6Rd AdRc)= bRc is 
the true statement (TAT)=>T. Take note that if x=b, y=e and z=c, 
then (bRe AeRc) => bRc becomes (F AF) => T, which is still true. It’s not 
much fun, but going through all the combinations, you can verify that 
(xRy \yRz)=>xRz is true for all choices x,y,z¢ A. (Try at least a few of 
them.) 

The relation R from Example 11.7 has a meaning. You can think of 
xRy as meaning that x and y are both consonants. Thus bRc because b 
and ¢ are both consonants; but bRe because it’s not true that b and e are 
both consonants. Once we look at it this way, it’s immediately clear that R 
has to be transitive. If x and y are both consonants and y and z are both 
consonants, then surely x and z are both consonants. This illustrates a 
point that we will see again later in this section: Knowing the meaning of 
a relation can help us understand it and prove things about it. 

Here is a picture of R. Notice that we can immediately spot several 
properties of R that may not have been so clear from its set description. 
For instance, we see that R is not reflexive because it lacks a loop at e, 
hence eRe. 


o, 


CR *D 


Figure 11.1. The relation R from Example 11.7 


Properties of Relations 179 


In what follows, we summarize how to spot the various properties of a 
relation from its diagram. Compare these with Figure 11.1. 


A relation is Pisoni 

1.) reflexive if ox a Cx 
: loop at x: 

for each point x ... 


A relation is : 
fais ... there is also 
symmetric if ayaa 
Ww 
2. | whenever there isan Xe >e/y xec Sey 
y back to x: 
arrow from x to y ... 


A relation is 
ivancitivear y ...there is also 


y 
whenever there are an arrow from 
x to z: 
arrows from x to y if Z x 2 
and y to z ... 
(If x =z, this means 4 
that if there are ] 


arrows from x to y 
and from y tox... 


... there is also 
a loop from 
x back to x.) 


Reg ee 


Consider the bottom diagram in Box 3, above. The transitive property 
demands (xRyA yRx)=>xRx. Thus, if xRy and yRx in a transitive relation, 
then also xRx, so there is a loop at x. In this case (yRxAxRy)=> yRy, so 
there will be a loop at y too. 


Although these visual aids can be illuminating, their use is limited be- 
cause many relations are too large and complex to be adequately described 
as diagrams. For example, it would be impossible to draw a diagram 
for the relation = (mod n), where ne N. Such a relation would best be 
explained in a more theoretical (and less visual) way. 

We next prove that = (mod n) is reflexive, symmetric and transitive. 
Obviously we will not glean this from a drawing. Instead we will prove it 
from the properties of = (mod n) and Definition 11.2. Pay attention to this 
example. It illustrates how to prove things about relations. 
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Example 11.8 Prove the following proposition. 


Proposition Let neéeN. The relation = (mod n) on the set Z is reflexive, 
symmetric and transitive. 


Proof. First we will show that = (mod n) is reflexive. Take any integer x € Z, 
and observe that n|0, so n|(x—x). By definition of congruence modulo n, 
we have x =x (mod n). This shows x =x (mod n) for every x € Z, so = (mod n) 
is reflexive. 

Next, we will show that = (mod n) is symmetric. For this, we must show 
that for all x,y eZ, the condition x = y (mod n) implies that y =x (mod n). 
We use direct proof. Suppose x = y (mod n). Thus n|(x-—y) by definition 
of congruence modulo n. Then x- y=na for some aé€ Z by definition of 
divisibility. Multiplying both sides by -1 gives y-x=n(-a). Therefore 
n|(y—x), and this means y=x (mod n). We’ve shown that x= y (mod n) 
implies that y =x (mod n), and this means = (mod n) is symmetric. 

Finally we will show that = (mod n) is transitive. For this we must 
show that if x = y (mod n) and y=z (mod n), then x =z (mod n). Again 
we use direct proof. Suppose x = y (mod n) and y=z (mod n). This 
means n|(x—y) and n|(y—z). Therefore there are integers a and b for 
which x—y=na and y—z=nb. Adding these two equations, we obtain 
x-z=na+nb. Consequently, x—z=n(a+b), so n|(x—z), hence x =z (mod n). 
This completes the proof that = (mod n) is transitive. 

The past three paragraphs have shown that = (mod n) is reflexive, 
symmetric and transitive, so the proof is complete. a 


As you continue your mathematical education, you will find that the 
reflexive, symmetric and transitive properties take on special significance 
in a variety of settings. In preparation for this, the next section explores 
further consequences of these relations. But first you should work some of 
the following exercises. 


Exercises for Section 11.1 


1. Consider the relation R = {(a,a),(b,b),(c,c),(d,d),(a,b),(b,a)} on set A = {a,b,c,d}. 
Is R reflexive? Symmetric? Transitive? If a property does not hold, say why. 

2. Consider the relation R = {(a,b),(a,c),(c,c),(b, b),(c, b),(b, c)} on the set A = {a,b,c}. 
Is R reflexive? Symmetric? Transitive? If a property does not hold, say why. 

3. Consider the relation R = {(a,b),(a,c),(c,b),(b,c)} on the set A= {a,b,c}. Is R 
reflexive? Symmetric? Transitive? If a property does not hold, say why. 
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4. 


Let A = {a,b,c,d}. Suppose R is the relation 


R = {(a,a),(b,b),(c,c),(d,d),(a,b),(b,a),(a,c),(c,a), 
(a,d),(d,a),(b,c),(c, b),(b,d),(d, b), (c,d), (d,c)}. 


Is R reflexive? Symmetric? Transitive? If a property does not hold, say why. 


. Consider the relation R = {(0,0),(V2,0),(0, V2),(V2, V2)} on R. Is R reflexive? 


Symmetric? Transitive? If a property does not hold, say why. 


. Consider the relation R = {(x,x):x¢€Z} on Z. Is R reflexive? Symmetric? 


Transitive? If a property does not hold, say why. What familiar relation is 
this? 


. There are 16 possible different relations R on the set A = {a,b}. Describe all of 


them. (A picture for each one will suffice, but don’t forget to label the nodes.) 
Which ones are reflexive? Symmetric? Transitive? 


. Define a relation on Z as xRy if |x—y| <1. Is R reflexive? Symmetric? Transitive? 


If a property does not hold, say why. What familiar relation is this? 


. Define a relation on Z by declaring xRy if and only if x and y have the same 


parity. Is R reflexive? Symmetric? Transitive? If a property does not hold, say 
why. What familiar relation is this? 


. Suppose AZ#@. Since GC Ax A, the set R=@ is a relation on A. Is R reflexive? 


Symmetric? Transitive? If a property does not hold, say why. 


. Suppose A = {a,b,c,d} and R = {(a,a),(b,b),(c,c),(d,d)}. Is R reflexive? Symmet- 


ric? Transitive? If a property does not hold, say why. 


. Prove that the relation | (divides) on the set Z is reflexive and transitive. (Use 


Example 11.8 as a guide if you are unsure of how to proceed.) 


. Consider the relation R = {(x,y)eRxR:x-—yeZ} on R. Prove that this relation 


is reflexive, symmetric and transitive. 


. Suppose R is a symmetric and transitive relation on a set A, and there is an 


element ace A for which aRx for every xe A. Prove that R is reflexive. 


. Prove or disprove: If a relation is symmetric and transitive, then it is also 


reflexive. 


. Define a relation R on Z by declaring that xRy if and only if x? = y? (mod 4). 


Prove that R is reflexive, symmetric and transitive. 


. Modifying the above Exercise 8 (above) slightly, define a relation ~ on Z asx~y 


if and only if |x—y| <1. Say whether ~ is reflexive. Is it symmetric? Transitive? 


. The table on page 177 shows that relations on Z may obey various combinations 


of the reflexive, symmetric and transitive properties. In all, there are 2° = 
8 possible combinations, and the table shows 5 of them. (There is some 
redundancy, as < and | have the same type.) Complete the table by finding 
examples of relations on Z for the three missing combinations. 
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11.2 Equivalence Relations 


The relation = on the set Z (or on any set A) is reflexive, symmetric 
and transitive. There are many other relations that are also reflexive, 
symmetric and transitive. Relations that have all three of these properties 
occur very frequently in mathematics and often play quite significant roles. 
(For instance, this is certainly true of the relation =.) Such relations are 
given a special name. They are called equivalence relations. 


Definition 11.3 A relation R on a set A is an equivalence relation if 
it is reflexive, symmetric and transitive. 


As an example, Figure 11.2 shows four different equivalence relations 
R1,R2,R3 and R4 on the set A = {-1,1,2,3,4}. Each one has its own meaning, 
as labeled. For example, in the second row the relation R2 literally means 
“has the same parity as.” So 1R23 means “1 has the same parity as 3,” etc. 


Relation R Diagram Equivalence classes 
(see next page) 
“is equal to” (=) BP MEO Gay alt 


Ri =((-1,-D,(, D, 2, 2), (8,3),(4,4)} 


{3}, 14} 


“has same parity as” 
R2 = {(-1,-), (1, D, 2, 2), (3,3), (4,4), 
(-1, 1),(1,-1),(-1, 3), (8, -1), 
(1,3),(8, 1), (2,4), (4, 2)} 


{-1,1,3}, {2,4} 


“has same sign as” 

Rg = {(-1,-1), (1, 1), (2, 2), (8, 3),(4,4), 
(1,2), (2, 1), (1,3), (3, 1), (1, 4), (4, 1), 
(2,3), (3,2), (2,4), (4,2), (1,3),(3, I} 


{-1}, {1,2,3,4} 


“has same parity and sign as” i) 


R4= {(-1,-D, (1, 1), (2, 2), (3, 3), (4, 4), 


@) 
i {-1f, {1,3}, {2,4} 
(1,3),(3, D, (2,4), (4,2)} (1) 


Figure 11.2. Examples of equivalence relations on the set A = {—1,1,2,3,4} 
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The above diagrams make it easy to check that each relation is reflexive, 
symmetric and transitive, i.e., that each is an equivalence relation. For 
example, R; is symmetric because xR;y > yR,x is always true: When x= y 
it becomes T => T (true), and when x # y it becomes F => F (also true). In 
a similar fashion, R; is transitive because (xR1y A yR1z)>xR,z is always 
true: It always works out to one of T>T, F>T or F=>F. (Check this.) 

As you can see from the examples in Figure 11.2, equivalence relations 
on a set tend to express some measure of “sameness” among the elements 
of the set, whether it is true equality or something weaker (like having 
the same parity). 

It’s time to introduce an important definition. Whenever you have 
an equivalence relation R on a set A, it divides A into subsets called 
equivalence classes. Here is the definition: 


Definition 11.4 Suppose R is an equivalence relation on a set A. Given 
any element ac A, the equivalence class containing a is the subset 
{x€A:xRa} of A consisting of all the elements of A that relate to a. This 
set is denoted as [a]. Thus the equivalence class containing a is the set 
[a]={xeA:xRa}. 


Example 11.9 Consider the relation R,; in Figure 11.2. The equivalence 
class containing 2 is the set [2] ={x¢A:xR,2}. Since in this relation 2 
relates to itself and nothing else, we have [2] = {2}. Other equivalence 
classes for R; are [-1]={-1}, [1]={1}, [3]= {3} and [4]={4}. Thus this 
relation has five separate equivalence classes. 


Example 11.10 Consider the relation R2 in Figure 11.2. The equivalence 
class containing 2 is the set [2] ={x¢ A:xR 22}. Since 2 relates only to itself 
and 4, we have [2]= {2,4}. Observe that we also have [4]= {xe A:xR24}= 
{2,4', so [2]=[4]. Another equivalence class for R2 is [1]={x¢ A:xRe1} 
= {-1,1,3}. In addition, note that [1] = [-1] =[3]={-1,1,3}. Thus this 
relation has just two equivalence classes, namely {2,4} and {-1,1,3}. 


Example 11.11 The relation R, in Figure 11.2 has three equivalence 
classes. They are [-1]={-1} and [1] =[3]= {1,3} and [2] =[4] = {2,4}. 


Don’t be misled by Figure 11.2. It’s important to realize that not 
every equivalence relation can be drawn as a diagram involving nodes 
and arrows. Even the simple relation R = {(x,x):x€R}, which expresses 
equality in the set R, is too big to be drawn. Its picture would involve a 
point for every real number and a loop at each point. Clearly that’s too 
many points and loops to draw. 
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We close this section with several other examples of equivalence rela- 
tions on infinite sets. 


Example 11.12 Let P be the set of all polynomials with real coefficients. 
Define a relation R on P as follows. Given f(x), g(x)€P, let f(x)R g(x) mean 
that f(x) and g(x) have the same degree. Thus (x? +3x-4)R(3x?-2) and 
(x? + 3x? — 4) R (3x? —2), for example. It takes just a quick mental check to 
see that R is an equivalence relation. (Do it.) It’s easy to describe the 
equivalence classes of R. For example, [3x?+2] is the set of all polynomials 
that have the same degree as 3x? +2, that is, the set of all polynomials of 
degree 2. We can write this as [3x7 + 2] = {ax?+bx+c:a,b,ceR,a #0}. 


Example 11.8 proved that for a given néN the relation = (mod n) is 
reflexive, symmetric and transitive. Thus, in our new parlance, = (mod n) 
is an equivalence relation on Z. Consider the case n= 3. Let’s find the 
equivalence classes of the equivalence relation = (mod 8). The equivalence 
class containing 0 seems like a reasonable place to start. Observe that 


[0] = {x € Z:x=0(mod 3)} = 
{xeZ:8|(e—O} = {xe Z:813} ={...,-8,0,8,69,..-} 


Thus the class [0] consists of all the multiples of 3. (Or, said differently, 
[0] consists of all integers that have a remainder of 0 when divided by 8). 
Note that [0] = [3] = [6] =[9], etc. The number 1 does not show up in the 
set [0] so let’s next look at the equivalence class [1]: 


[1] = {xe Z:x=1(mod 3)} = {xe Z:3|(x—-D} ={...,-5,—-2,1,4,7,10,...}. 


The equivalence class [1] consists of all integers that give a remainder of 
1 when divided by 3. The number 2 is in neither of the sets [0] or [1], so 
we next look at the equivalence class [2]: 


[2] = {ae Z:x=2(mod 3)} = {xe Z:3|(x—2)} = {...,-4,-1,2,5,8, 11,...}. 


The equivalence class [2] consists of all integers that give a remainder of 
2 when divided by 3. Observe that any integer is in one of the sets [0], [1] 
or [2], so we have listed all of the equivalence classes. Thus = (mod 8) has 
exactly three equivalence classes, as described above. 

Similarly, you can show that the equivalence relation = (mod n) has n 
equivalence classes [0],[1],[2],..., [2-1]. 
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Exercises for Section 11.2 


10. 


11. 


12. 


13. 


14. 


15. 


. Let A= {1,2,3,4,5,6}, and consider the following equivalence relation on A: 


R = {(1, 1), (2, 2), (3,3), (4,4), (5,5), (6, 6), (2,3), (3, 2), (4,5), (5, 4), (4, 6), (6, 4), (5,6), (6, 5)} 
List the equivalence classes of R. 


. Let A= {a,b,c,d,e}. Suppose R is an equivalence relation on A. Suppose R has 


two equivalence classes. Also aRd, bRc and eRd. Write out R as a set. 


. Let A= {a,b,c,d,e}. Suppose R is an equivalence relation on A. Suppose R has 


three equivalence classes. Also aRd and bRc. Write out R as a set. 


. Let A= {a,b,c,d,e}. Suppose R is an equivalence relation on A. Suppose also 


that aRd and bRc, eRa and cRe. How many equivalence classes does R have? 


. There are two different equivalence relations on the set A = {a,b}. Describe 


them. Diagrams will suffice. 


. There are five different equivalence relations on the set A = {a,b,c}. Describe 


them all. Diagrams will suffice. 


. Define a relation R on Z as xRy if and only if 3x—5y is even. Prove R is an 


equivalence relation. Describe its equivalence classes. 


. Define a relation R on Z as xRy if and only if x?+y? is even. Prove R is an 


equivalence relation. Describe its equivalence classes. 


. Define a relation R on Z as xR y if and only if 4|(x+3y). Prove R is an equivalence 


relation. Describe its equivalence classes. 


Suppose R and S are two equivalence relations on a set A. Prove that RnS 
is also an equivalence relation. (For an example of this, look at Figure 11.2. 
Observe that for the equivalence relations R2,R3 and R4, we have Ren R3 = R4.) 


Prove or disprove: If R is an equivalence relation on an infinite set A, then R 
has infinitely many equivalence classes. 


Prove or disprove: If R and S are two equivalence relations on a set A, then 
RUS is also an equivalence relation on A. 


Suppose R is an equivalence relation on a finite set A, and every equivalence 
class has the same cardinality m. Find a formula for |R| in terms of m. 


Suppose R is a reflexive and symmetric relation on a finite set A. Define 
a relation S on A by declaring xSy if and only if for some neé€N there are 
elements x1,%9,...,x,¢€A satisfying xRx 1, x1Rx2, x9Rx3, x3Rx4,...,X,-1Rxn, and 
x,Ry. Show that S is an equivalence relation and R¢S. Prove that S is the 
unique smallest equivalence relation on A containing R. 


Suppose R is an equivalence relation on a set A, with four equivalence classes. 
How many different equivalence relations S on A are there for which R<S? 
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11.3 Equivalence Classes and Partitions 


This section collects several properties of equivalence classes. 

Our first result proves that [a]=[b] if and only if aRb. This is useful 
because it assures us that whenever we are in a situation where [a] = [bd], 
we also have aRb, and vice versa. Being able to switch back and forth 
between these two pieces of information can be helpful in a variety of 
situations, and you may find yourself using this result a lot. Be sure to 
notice that the proof uses all three properties (reflexive, symmetric and 
transitive) of equivalence relations. Notice also that we have to use some 
Chapter 8 techniques in dealing with the sets [a] and [bd]. 


Theorem 11.1 Suppose R is an equivalence relation on a set A. Suppose 
also that a,b€ A. Then [a] =[b] if and only if aRb. 


Proof. Suppose [a] =[b]. Note that aRa by the reflexive property of R, so 
aeé{xe€A:xRa}=[a]=[b]= {xe A:xRb}. But a belonging to {xe A:xRb} 
means aRb. This completes the first part of the if-and-only-if proof. 

Conversely, suppose aRb. We need to show [a] =[b]. We will do this by 
showing [a] <[b] and [b]¢ [al]. 

First we show [a]<[b]. Suppose ce [a]. As ce [a]= {xe A:xRa}, we get 
cRa. Now we have cRa and aRb, so cRb because R is transitive. But cRb 
implies ce {x¢ A:xRb}=[b]. This demonstrates that c € [a] implies c€ [b], 
so [a] <[b]. 

Next we show [b]¢l[a]. Suppose ce[b]. As ce[b]= {xe A:xRb}, we get 
cRb. Remember that we are assuming aRb, so bRa because R is symmetric. 
Now we have cRb and bRa, so cRa because R is transitive. But cRa implies 
ce {x¢€A:xRa}=[a]. This demonstrates that c€[b] implies c€ [a]; hence 
[b] < [a]. 

The previous two paragraphs imply that [a] = [bd]. a 


To illustrate Theorem 11.1, recall how we worked out the equivalence 
classes of = (mod 3) at the end of Section 11.2. We observed that 


[-3] =[9] = {...,-3,0,3,6,9,...}. 


Note that [—3]=[9] and —3=9 (mod 3), just as Theorem 11.1 predicts. The 
theorem assures us that this will work for any equivalence relation. In the 
future you may find yourself using the result of Theorem 11.1 often. Over 
time it may become natural and familiar; you will use it automatically, 
without even thinking of it as a theorem. 
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Our next topic addresses the fact that an equivalence relation on a set 
A divides A into various equivalence classes. There is a special word for 
this kind of situation. We address it now, as you are likely to encounter it 
in subsequent mathematics classes. 


Definition 11.5 A partition of a set A is a set of non-empty subsets of 
A, such that the union of all the subsets equals A, and the intersection of 
any two different subsets is @. 


Example 11.13 Let A = {a,b,c,d}. One partition of A is {{a,b}, {c}, {d}}. 
This is a set of three subsets {a,b}, {c} and {d} of A. The union of the 
three subsets equals A; the intersection of any two subsets is @. 

Other partitions of A are 


Hab} ied}, tach to}. tat}, tap tos tchat}, — t1a,b,e, dj}, 


to name a few. Intuitively, a partition is just a dividing up of A into pieces. 


Example 11.14 Consider the equivalence relations in Figure 11.2. Each 
of these is a relation on the set A = {-1,1,2,3,4}. The equivalence classes 
of each relation are listed on the right side of the figure. Observe that, 
in each case, the set of equivalence classes forms a partition of A. For 
example, the relation R; yields the partition {{-1}, {1}, {2}, {3}, {43} of A. 
Likewise the equivalence classes of Re form the partition {{—1,1,3}, {2,43}. 


Example 11.15 Recall that we worked out the equivalence classes of the 
equivalence relation = (mod 3) on the set Z. These equivalence classes 
give the following partition of Z: 


10 25, 38,0)3, 6595.4.) 4 92,14, 710 fj, Sh 


We can write it more compactly as {[0],[1],[2]}. 


Our examples and experience suggest that the equivalence classes of 
an equivalence relation on a set form a partition of that set. This is indeed 
the case, and we now prove it. 


Theorem 11.2 Suppose R is an equivalence relation on a set A. Then 
the set {[a]:a¢A} of equivalence classes of R forms a partition of A. 


Proof. To show that {[a]:a¢ A} is a partition of A we need to show two 
things: We need to show that the union of all the sets [a] equals A, and 
we need to show that if [a] 4[b], then [a]n[b]= @. 
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Notationally, the union of all the sets [a] is Ugc ala], so we need to prove 
Uaeala]l= A. Suppose x€ Ugcala]. This means x€ [a] for some ace A. Since 
[a] cA, it then follows that xe A. Thus Ugeala] ¢ A. On the other hand, 
suppose x¢€ A. As x€[x], we know x€ [a] for some ac A (namely a =x). 
Therefore x € Ugcala], and this shows A CUgcala]. Since Ugeala] GA and 
AScUnealal, it follows that Ugeala] =A. 

Next we need to show that if [a] 4[b] then [a]n[b]= 9. Let’s use 
contrapositive proof. Suppose it’s not the case that [a]n[b] =, so there is 
some element c with c€[a]n[b]. Thus ce [a] and ce [b]. Now, cé€ [a] means 
cRa, and then aRc since R is symmetric. Also ce [b] means cRb. Now 
we have aRc and cRbd, so aRb (because R is transitive). By Theorem 11.1, 
aRb implies [a] =[b]. Thus [a] 4[b] is not true. 

We’ve now shown that the union of all the equivalence classes is A, 
and the intersection of two different equivalence classes is ¢. Therefore 
the set of equivalence classes is a partition of A. a 


Theorem 11.2 says the equivalence classes of any equivalence relation 
on a set A form a partition of A. Conversely, any partition of A describes 
an equivalence relation R where xRy if and only if x and y belong to the 
same set in the partition. (See Exercise 4 for this section, below.) Thus 
equivalence relations and partitions are really just two different ways of 
looking at the same thing. In your future mathematical studies, you may 
find yourself easily switching between these two points of veiw. 


Exercises for Section 11.3 


1. List all the partitions of the set A = {a,b}. Compare your answer to the answer 
to Exercise 5 of Section 11.2. 

2. List all the partitions of the set A = {a,b,c}. Compare your answer to the 
answer to Exercise 6 of Section 11.2. 

3. Describe the partition of Z resulting from the equivalence relation = (mod 4). 

4, Suppose P is a partition of a set A. Define a relation R on A by declaring xR y 
if and only if x,y¢ X for some X € P. Prove R is an equivalence relation on A. 
Then prove that P is the set of equivalence classes of R. 

5. Consider the partition P = {{...,—4,-2,0,2,4,...},{...,-5,-3,-1,1,3,5,...}} of Z. 
Let R be the equivalence relation whose equivalence classes are the two ele- 
ments of P. What familiar equivalence relation is R? 
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11.4 The Integers Modulo n 


Example 11.8 proved that for a given neéN, the relation = (mod n) is 
reflexive, symmetric and transitive, so it is an equivalence relation. This 
is a particularly significant equivalence relation in mathematics, and in 


the present section we deduce some of its properties. 

To make matters simpler, let’s pick a concrete n, say n=5. Let’s begin 
by looking at the equivalence classes of the relation = (mod 5). There are 
five equivalence classes, as follows: 


[0] = {xeZ:x=0(mod5))} = {xeZ:5|(x-0)} = {...,-10,-5,0,5,10,15,...}, 
[1] = {xeZ:x=1(mod5))} = {xeZ:5|/(x-D} = {..., -9,-4,1,6,11,16,...}, 
[2] = {xeZ:x=2(mod5))} = {xeZ:5|(x-2)} = {..., -8,-3,2,7,12,17,...}, 
[3] = {xeZ:x=3 (mod 5))} = {xeZ:5|(x-3)} = {..., -7,-2,3,8,13,18,...}, 
[4] = {xeZ:x=4(mod5))} = {xeZ:5|(x-4} = {..., -6,-1,4,9,14,19,...}. 


Notice how these equivalence classes form a partition of the set Z. 
We label the five equivalence classes as [0],[1],[21,[3] and [4], but you 
know of course that there are other ways to label them. For example, 
[0] = [5] = [10] = [15], and so on; and [1] = [6] = [-4], etc. Still, for this 
discussion we denote the five classes as [0],[1],[2],[3] and [4]. 

These five classes form a set, which we shall denote as Z5. Thus 


Zs = {(0],(11,121,[3],[4]} 


is a set of five sets. The interesting thing about Zs is that even though its 
elements are sets (and not numbers), it is possible to add and multiply 
them. In fact, we can define the following rules that tell how elements of 
Zs can be added and multiplied. 


[a]+[b] la +b] 
[a]-[6] = [a-b] 


For example, [2]+[1] =[2+1]=[8], and [2]-[2] =[2-2]=[4]. We stress 
that in doing this we are adding and multiplying sets (more precisely 
equivalence classes), not numbers. We added (or multiplied) two elements 
of Z; and obtained another element of Z;. 

Here is a trickier example. Observe that [2]+[3]=[5]. This time we 
added elements [2],[3]€ Z5, and got the element [5]¢ Z5. That was easy, 
except where is our answer [5] in the set Z5 = {[0],[1],[2],[3],[4]}? Since 
[5]=[0], it is more appropriate to write [2]+[3] = [0]. 
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In a similar vein, [2]-[8]=[6] would be written as [2]-[3]=[1] because 
[6]=[1]. Test your skill with this by verifying the following addition and 
multiplication tables for Z5. 


+ | f0] [1] [2] [3] [4] - | [0] (1) [2] [3] [4] 


10] | (0) (1) [2] [3] [4] [0] ; [0] [0] [0] [0] [0] 
(11); 11] [2] [3] [4] [0] [1] | [0] (1) [2] [3] [4] 
[2] | [2] [38] [4] [0] [4] [2] | [0] [2] [4] [1] [3] 
[3] | [3] [4] [0] [0 [2] [3] |} [0] [38] [1 [4] [2] 
[4] | [4] [0] [1] [2] [3] [4] | [0] [4] [3] 2) 


We call the set Zs = {[0],[1],[2],[3],[4]} the integers modulo 5. As our 
tables suggest, Zs; is more than just a set: It is a little number system 
with its own addition and multiplication. In this way it is like the familiar 
set Z which also comes equipped with an addition and a multiplication. 

Of course, there is nothing special about the number 5. We can also 
define Z, for any natural number n. Here is the definition: 


Definition 11.6 Let n—eN. The equivalence classes of the equivalence 
relation = (mod n) are [0],[1],[2],...,[.—1]. The integers modulo n is the 
set Z, = {[0],[1],[2],...,[2—1]}. Elements of Z, can be added by the rule 
[a]+[b]=[a+b] and multiplied by the rule [a]-[b] = [ab]. 


Given a natural number n, the set Z, is a number system containing n 
elements. It has many of the algebraic properties that Z,R and Q possess. 
For example, it is probably obvious to you already that elements of Z,, 
obey the commutative laws [a]+[b] =[b]+[a] and [a]-[b]=[b]-[a]. You can 
also verify that the distributive law [a]-([b]+[c]) =[a]-[b]+[a]-[c] holds, as 
follows: 


la]-(lb]+le]) = [a]-[b+c] 
= [a(b+c)] 
= [ab+ac] 
= [ab]+lac] 
= [a]-[b]+[a]-[el. 


The integers modulo n are significant because they more closely fit certain 
applications than do other number systems such as Z or R. If you go on to 
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take a course in abstract algebra, then you will work extensively with Z,, 
as well as other, more exotic, number systems. (In such a course you will 
also use all of the proof techniques that we have discussed, as well as the 
ideas of equivalence relations.) 

To close this section we take up an issue that may have bothered 
you earlier. It has to do with our definitions of addition [a]+[b] = [a+b] 
and multiplication [a]-[b]=[ab]. These definitions define addition and 
multiplication of equivalence classes in terms of representatives a and b 
in the equivalence classes. Since there are many different ways to choose 
such representatives, we may well wonder if addition and multiplication 
are consistently defined. For example, suppose two people, Alice and Bob, 
want to multiply the elements [2] and [8] in Z;. Alice does the calculation 
as [2]-[8] = [6] =[1], so her final answer is [1]. Bob does it differently. 
Since [2] =[7] and [38] =[8], he works out [2]-[3] as [7]-[8] =[56]. Since 56=1 
(mod 5), Bob’s answer is [56] = [1], and that agrees with Alice’s answer. Will 
their answers always agree or did they just get lucky (with the arithmetic)? 

The fact is that no matter how they do the multiplication in Z,, their 
answers will agree. To see why, suppose Alice and Bob want to multiply 
the elements [a],[b]¢ Z,, and suppose [a] = [a’] and [b] =[b’]. Alice and Bob 
do the multiplication as follows: 


Alice: [a]-[b] =[ab], 
Bob: [a’]-[b'] = [a’b’]. 


We need to show that their answers agree, that is we need to show 
[ab] =[a’b']. Since [a] =[a'], we know by Theorem 11.1 that a=a' (mod n). 
Thus n|(a-a’), so a—a’'=nk for some integer k. Likewise, as [b] =[b'], we 
know b=)! (mod n), or n|(b—b’), so b—b'=né for some integer @. Thus 
we get a=a'+nk and b=b'+né. Therefore: 


ab = (a'+nk)\(b'+né) 
ab = a’b'+a'nl+nkb'+n7ke 
ab—a'b’ n(a'l+kb' +nk?). 


This shows n|(ab—a'b'), so ab=a’b’ (mod n), and from that we conclude 
[ab] =[a’b']. Consequently Alice and Bob really do get the same answer, so 
we can be assured that the definition of multiplication in Z, is consistent. 

Exercise 8 below asks you to show that addition in Z, is similarly 
consistent. 
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Exercises for Section 11.4 


. Write the addition and multiplication tables for Z2. 
. Write the addition and multiplication tables for Z3. 
. Write the addition and multiplication tables for Z4. 
. Write the addition and multiplication tables for Zg. 


a FP & NH = 


. Suppose [a],[b]e Z5 and [a]-[b]=[0]. Is it necessarily true that either [a] = [0] 
or [b] = [0]? 
6. Suppose [a],[b]€ Zg and [a]-[b]=[0]. Is it necessarily true that either [a] = [0] 
or [b] = [0]? 
7. Do the following calculations in Z9, in each case expressing your answer as [a] 
with 0<a<8. 
(a) [8]+[8] (b) [24]+[11] (ce) [21]-[15] (d) [8]-[8] 
8. Suppose [a],[b] « Z,, and [a] =[a’] and [b] = [b’]. Alice adds [a] and [b] as 
[a]+[b]=[a+b]. Bob adds them as [a’]+[b’] =[a’+b’]. Show that their answers 
[a+b] and [a'+b’] are the same. 


11.5 Relations Between Sets 


In the beginning of this chapter, we defined a relation on a set A to 
be a subset RC Ax A. This created a framework that could model any 
situation in which elements of A are compared to themselves. In this 
setting, the statement xRy has elements x and y from A on either side 
of the R because R compares elements from A. But there are other 
relational symbols that don’t work this way. Consider ¢. The statement 
5€ Z expresses a relationship between 5 and Z (namely that the element 5 
is in the set Z) but 5 and Z are not in any way naturally regarded as both 
elements of some set A. To overcome this difficulty, we generalize the idea 
of a relation on A to a relation from A to B. 


Definition 11.7 A relation from a set A to a set B is asubset RC AxB. 
We often abbreviate the statement (x,y)« R asxRy. The statement (x,y)¢R 
is abbreviated as xf y. 


Example 11.16 Suppose A = {1,2} and B= YA(A) = {@, {1}, {2}, {1,2}}. Then 
R = {(1, {1}), (2, {2$), (1, {1, 2}), (2, {1,2})} AB is a relation from A to B. Note 
that we have 1R{1}, 2R{2}, 1R{1,2} and 2R{1,2}. The relation R is the 
familiar relation € for the set A, that is, xR X means exactly the same 
thing as xe X. 
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Diagrams for relations from A to B differ from diagrams for relations 
on A. Since there are two sets A and B in a relation from A to B, we have 
to draw labeled nodes for each of the two sets. Then we draw arrows from x 
to y whenever xRy. The following figure illustrates this for Example 11.16. 


Figure 11.3. A relation from A to B 


The ideas from this chapter show that any relation (whether it is a 
familiar one like =, <, =, |, € or ¢, or a more exotic one) is really just a set. 
Therefore the theory of relations is a part of set theory. In the next chapter, 
we will see that this idea touches on another important mathematical 
construction, namely functions. We will define a function to be a special 
kind of relation from one set to another, and in this context we will see 
that any function is really just a set. 


CHAPTER 12 


Functions 


YY" know from calculus that functions play a fundamental role in math- 
ematics. You likely view a function as a kind of formula that describes 
a relationship between two (or more) quantities. You certainly understand 
and appreciate the fact that relationships between quantities are impor- 
tant in all scientific disciplines, so you do not need to be convinced that 
functions are important. Still, you may not be aware of the full significance 
of functions. Functions are more than merely descriptions of numeric 
relationships. In a more general sense, functions can compare and relate 
different kinds of mathematical structures. You will see this as your 
understanding of mathematics deepens. In preparation of this deepening, 
we will now explore a more general and versatile view of functions. 

The concept of a relation between sets (Definition 11.7) plays a big role 
here, so you may want to quickly review it. 


12.1 Functions 


Let’s start on familiar ground. Consider the function f(x) = x? from R to R. 
Its graph is the set of points R = {(x,x”):xeR} CRxR. 


Figure 12.1. A familiar function 


Having read Chapter 11, you may see f in a new light. Its graph 
RCRxR is a relation on the set R. In fact, as we shall see, functions 
are just special kinds of relations. Before stating the exact definition, we 
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look at another example. Consider the function f(n) = |n|+2 that converts 
integers n into natural numbers |n|+2. Its graph is R = {(n,|n|+2):ne Z} 
Cc ZxN. 


Figure 12.2. The function f :Z—N, where f(n) = |n|+2 


Figure 12.2 shows the graph R as darkened dots in the grid of points Z x N. 
Notice that in this example R is not a relation on a single set. The set of 
input values Z is different from the set N of output values, so the graph 
R¢ZxN isa relation from Z to N. 

This example illustrates three things. First, a function can be viewed 
as sending elements from one set A to another set B. (In the case of f, 
A=Z and B=N.) Second, such a function can be regarded as a relation 
from A to B. Third, for every input value n, there is exactly one output 
value f(n). In your high school algebra course, this was expressed by the 
vertical line test: Any vertical line intersects a function’s graph at most 
once. It means that for any input value x, the graph contains exactly one 
point of form (x, f(x)). Our main definition, given below, incorporates all of 
these ideas. 


Definition 12.1 Suppose A and B are sets. A function f from A to B 
(denoted as f:A— 8B) is a relation f¢AxB from A to B, satisfying the 
property that for each a< A the relation f contains exactly one ordered 
pair of form (a,b). The statement (a,b)¢ f is abbreviated f(a) = b. 


Example 12.1 Consider the function f graphed in Figure 12.2. According 
to Definition 12.1, we regard f as the set of points in its graph, that is, 
f ={(m,|n|+2):neZ} ¢ ZxN. This is a relation from Z to N, and indeed 
given any a€ Z the set f contains exactly one ordered pair (a,|a|+2) whose 
first coordinate is a. Since (1,3)€ f, we write f(1) =3; and since (-3,5)ef 
we write f(—3)=5, etc. In general, (a,b) ¢ f means that f sends the input 
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value a to the output value b, and we express this as f(a) = b. This function 
can be expressed by a formula: For each input value n, the output value 
is |n|+2, so we may write f(n)=|n|+2. All this agrees with the way we 
thought of functions in algebra and calculus; the only difference is that 
now we also think of a function as a relation. 


Definition 12.2 For a function f:A— B, the set A is called the domain 
of f. (Think of the domain as the set of possible “input values” for f.) The 
set B is called the codomain of f. The range of f is the set {f(a):a¢ A} = 
{b :(a,b)e f}. (Think of the range as the set of all possible “output values” 
for f. Think of the codomain as a sort of “target” for the outputs.) 


Continuing Example 12.1, the domain of f is Z and its codomain is 
N. Its range is {f(a):a€ Z}= {lal+2:ae€Z} = {2,3,4,5,...}. Notice that the 
range is a subset of the codomain, but it does not (in this case) equal the 
codomain. 

In our examples so far, the domains and codomains are sets of numbers, 
but this needn’t be the case in general, as the next example indicates. 


Example 12.2. Let A= {p,q,r,s} and B= {0,1,23, and 
f = {(p,0), (9, D,(r, 2),(s,2)} CA xB. 


This is a function f:A— B because each element of A occurs exactly once 
as a first coordinate of an ordered pair in f. We have f(p)=0, f(q)=1, 
f(r)=2 and f(s)=2. The domain of f is {p,q,r,s}, and the codomain and 
range are both {0,1,2}. 


B gn ns A B 
'(p,0)\(q,0) (7,0) (s,0) 


N s 


(Dg. Dir) (sD) ess 4 
i Btn ne de - Pe x 
(p,2) (q,2)\(r,2) (8,2)! We WN 


-~. ei 


(a) (b) 


Figure 12.3. Two ways of drawing the function f = {(p,0),(q, 1),(r,2),(s, 2)} 
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If A and B are not both sets of numbers it can be difficult to draw 
a graph of f: A—B in the traditional sense. Figure 12.3(a) shows an 
attempt at a graph of f from Example 12.2. The sets A and B are aligned 
roughly as x- and y-axes, and the Cartesian product A xB is filled in 
accordingly. The subset f < A x B is indicated with dashed lines, and this 
can be regarded as a “graph” of f. A more natural visual description of f 
is shown in 12.3(b). The sets A and B are drawn side-by-side, and arrows 
point from a to b whenever f(a) = 6. 

In general, if f : A — B is the kind of function you may have encountered 
in algebra or calculus, then conventional graphing techniques offer the 
best visual description of it. On the other hand, if A and B are finite or if 
we are thinking of them as generic sets, then describing f with arrows is 
often a more appropriate way of visualizing it. 

We emphasize that, according to Definition 12.1, a function is really 
just a special kind of set. Any function f:A— B is a subset of Ax B. By 
contrast, your calculus text probably defined a function as a certain kind of 
“rule.” While that intuitive outlook is adequate for the first few semesters 
of calculus, it does not hold up well to the rigorous mathematical standards 
necessary for further progress. The problem is that words like “rule” are 
too nebulous. Phrasing the definition of a function in the language of sets 
removes the ambiguity. 

Still, in practice we tend to think of functions as rules. Given f:Z—N 
where f(x) = |x|+2, we think of this as a rule that associates any number 
neéZ to the number |n|+2 in N, rather than a set containing ordered 
pairs (n,|n|+2). It is only when we have to understand or interpret the 
theoretical nature of functions (as we do in this text) that Definition 12.1 
comes to bear. The definition is a foundation that gives us license to think 
about functions in a more informal way. 

The next example brings up a point about notation. Consider a function 
such as f :Z* — Z, whose domain is a Cartesian product. This function 
takes as input an ordered pair (m,n) € Z? and sends it to a number f((m,n)) € 
Z. To simplify the notation, it is common to write f(m,n) instead of f((m,n)), 
even though this is like writing fx instead of f(x). We also remark that 
although we’ve been using the letters f,g and h to denote functions, any 
other reasonable symbol could be used. Greek letters such as g and @ are 
common. 


Example 12.3 Say a function y: Z? — Z is defined as y(m,n) = 6m—-9n. 
Note that as a set, this function is g = {((m,n),6m—9n) : (m,n)€ Z7} CZ? xZ. 
What is the range of ~? 
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To answer this, first observe that for any (m,n) € Z?, the value f(m,n) = 
6m—9n = 3(2m-—3n) is a multiple of 3. Thus every number in the range is 
a multiple of 3, so the range is a subset of the set of all multiples of 3. On 
the other hand if b = 3k is a multiple of 3 we have y(-k, —k) = 6(—k)-9(-k) = 
3k = b, which means any multiple of 3 is in the range of y. Therefore the 
range of ¢ is the set {3k:k € Z} of all multiples of 3. 


To conclude this section, let’s use Definition 12.1 to help us understand 
what it means for two functions f:A—B and g:C—D to be equal. 
According to our definition, functions f and g are subsets fc AxB and 
g<CxD. If the two functions are to be equal, we require that the following 
sets be equal: f=g, A=C, and B=D. 

Suppose for example, that A = {1,2,3} and B= {a,b}. The two func- 
tions f = {(1,a),(2,a),(3,6)} and g = {(3,b),(2,a),(1,a)} from A to B are equal 
because the sets f and g are equal. Observe that the equality f = g 
means f/f (x)= g(x) for every xe A. We repackage this idea in the following 
definition. 


Definition 12.3 Two functions f:A— Band g:C —D are equal if A=C, 
B=D and f(x) = g(x) for every xe A. 


According to the definition, to show functions f and g are equal, we just 
need to confirm that their domains are equal, their codomains are equal 
and f(x) = g(x) for every x in the domain. There is a shade of meaning to 
watch out for here. Consider the functions f:Z—N and g:Z— Z defined 
as f(x) =|x|+2 and g(x)=|x|+2. Even though their domains are the same 
and f(x) = g(x) for every x in the domain, they are technically not equal 
because their codomains differ. 


Exercises for Section 12.1 


1. Suppose A = {0,1,2,3,4}, B = {2,3,4,5} and f = {(0,3),(1,3), (2,4), (3, 2),(4,2)}. State 
the domain and range of f. Find f(2) and f(1). 

2. Suppose A = {a,b,c,d}, B = {2,3,4,5,6} and f = {(a,2),(b,3),(c,4),(d,5)}. State the 
domain and range of f. Find f(b) and f(d). 

3. There are four different functions f : {a,b} — {0,1}. List them all. Diagrams 
will suffice. 

4. There are eight different functions f : {a,b,c} — {0,1}. List them all. Diagrams 
will suffice. 

5. Give an example of a relation from {a,b,c,d} to {d,e} that is not a function. 
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6. Suppose f : Z — Z is defined as f = {(x,4x+5):x eZ}. State the domain, codomain 
and range of f. Find f(10). 

7. Consider the set f = {(x,y)€Zx Z:3x+y=4}. Is this a function from Z to Z? 
Explain. 

8. Consider the set f = {(x,y)€ Zx Z:x+3y=4}. Is this a function from Z to Z? 
Explain. 

9. Consider the set f = {(x?,x):x eR}. Is this a function from R to R? Explain. 
10. Consider the set f = {(x?,x):x€R}. Is this a function from R to R? Explain. 
11. Is the set 6 = {(X,|X|):X <Z5} a function? If so, what is its domain and range? 
12. Is the set 0 = {((x, y),(3y,2x,x+ y)):x,yeR} a function? If so, what is its domain, 

codomain and range? 


12.2 Injective and Surjective Functions 


You may recall from algebra and calculus that a function may be one- 
to-one and onto, and these properties are related to whether or not the 
function is invertible. We now review these important ideas. In advanced 
mathematics, the word injective is often used instead of one-to-one, and 
surjective is used instead of onto. Here are the exact definitions: 


Definition 12.4 A function f:A-—B is: 

1. injective (or one-to-one) if for every x,ye¢ A, x4 y implies f(x) Z f(y); 
2. surjective (or onto) if for every b< B there is anaeéA with f(a)=); 
3. bijective if f is both injective and surjective. 


Below is a visual description of Definition 12.4. In essence, injective 
means that unequal elements in A always get sent to unequal elements in 
B. Surjective means that every element of B has an arrow pointing to it, 
that is, it equals f(a) for some a in the domain of f. 


Injective means that for any 


two x,y¢A, this happens... and not this: 


Surjective means that for 
any beB... 


...this happens: 
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For more concrete examples, consider the following functions from R 
to R. The function f(x) =x? is not injective because —2 4 2, but f(—2) = f(2). 
Nor is it surjective, for if b = -1 (or if b is any negative number), then 
there is no a€ R with f(a)=b. On the other hand, g(x) = x? is both injective 
and surjective, so it is also bijective. 


There are four possible injective/surjective combinations that a function 
may possess. This is illustrated in the following figure showing four 
functions from A to B. Functions in the first column are injective, those 
in the second column are not injective. Functions in the first row are 
surjective, those in the second row are not. 


Injective Not injective 


Surjective 


Not surjective 


We note in passing that, according to the definitions, a function is 
surjective if and only if its codomain equals its range. 

Often it is necessary to prove that a particular function f:A— B 
is injective. For this we must prove that for any two elements x,y«A, 
the conditional statement (x #4 y) > (f(x) 4 f(y)) is true. The two main 
approaches for this are summarized below. 


How to show a function / : A — B is injective: 


Direct approach: Contrapositive approach: 
Suppose x,yeA and xy. Suppose x,ye€A and f(x) = f(y). 
Therefore f(x) 4 f(y). Therefore x = y. 


Of these two approaches, the contrapositive is often the easiest to use, 
especially if f is defined by an algebraic formula. This is because the 
contrapositive approach starts with the equation f(x) = f(y) and proceeds 
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to the equation x=. In algebra, as you know, it is usually easier to work 
with equations than inequalities. 

To prove that a function is not injective, you must disprove the statement 
(x 4 y)=> (f(x) # f(y). For this it suffices to find example of two elements 
x,y€A for which x4 y and f(x)= f(y). 

Next we examine how to prove that f:A—B is surjective. According 
to Definition 12.4, we must prove the statement Vbe B,dacA,f(a)=b. In 
words, we must show that for any b€ B, there is at least one ae A (which 
may depend on b) having the property that f(a)=b. Here is an outline. 


How to show a function / : A — B is surjective: 


Suppose b¢€ B. 
[Prove there exists a¢ A for which f(a) =).] 


In the second step, we have to prove the existence of an a for which 
f(a)=b. For this, just finding an example of such an a would suffice. (How 
to find such an example depends on how f is defined. If f is given as a 
formula, we may be able to find a by solving the equation f(a) = 6 for a. 
Sometimes you can find a by just plain common sense.) To show f is not 
surjective, we must prove the negation of Vbe B,ja€ A,f(a)=b, that is, 
we must prove 4b€B,VacA,f(a)Z#b. 

The following examples illustrate these ideas. (For the first example, 
note that the set R- {0} is R with the number 0 removed.) 


Example 12.4 Show that the function f :R— {0} — R defined as f(x) = 4++1 
is injective but not surjective. 

We will use the contrapositive approach to show that f is injective. 
Suppose x,y¢R-{0} and f(x)= f(y). This means : +1= ; +1. Subtracting 
1 from both sides and inverting produces x= y. Therefore f is injective. 

Function f is not surjective because there exists an element b=1—ER 
for which f(x)=+4+1#1 for every x €R- {0}. 


Example 12.5 Show that the function g:ZxZ-— ZxZ defined by the 
formula g(m,n) = (m+n,m+2n), is both injective and surjective. 

We will use the contrapositive approach to show that g is injective. 
Thus we need to show that g(m,n)= g(k,¢) implies (m,n) =(k,¢). Suppose 
(m,n), (k,f) € ZxZ and g(m,n)= gtk, @). Then (m+n,m+2n) = (k+¢,k+20). It 
follows that m+n=k+@ and m+2n=k+2¢. Subtracting the first equation 
from the second gives n=. Next, subtract n= ¢ from m+n=k+ @ to get 
m=k. Since m=k and n=, it follows that (m,n)=(k,¢). Therefore g is 
injective. 
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To see that g is surjective, consider an arbitrary element (b,c) € Z x Z. 
We need to show that there is some (x,y) ¢ Zx Z for which g(x,y) =(b,c). To 
find (x,y), note that g(x,y) =(b,c) means (x+ y,x+2y)=(b,c). This leads to 
the following system of equations: 


x + y = b 
x + 2Qy = 


Solving gives x =2b-c and y=c-—b. Then (x,y) =(2b-—c,c—b). We now 
have g(2b—c,c—6b)=(b,c), and it follows that g is surjective. 


Example 12.6 Consider function h:Z x Z— Q defined as h(m,n) = 


Determine whether this is injective and whether it is surjective. 

This function is not injective because of the unequal elements (1,2) and 
(1,-2) in Zx Z for which A(1,2) = h(1, -2) = $. However, h is surjective: Take 
any element b€ Q. Then b= § for some c,d € Z. Notice we may assume d is 
positive by making c negative, if necessary. Then h(c,d-—1)= Wout =7=6. 


In|+1° 


Exercises for Section 12.2 


1. Let A= {1,2,3,4} and B= {a,b,c}. Give an example of a function f: A — B that 
is neither injective nor surjective. 

2. Consider the logarithm function In:(0,0oo)— R. Decide whether this function is 
injective and whether it is surjective. 

3. Consider the cosine function cos: R—R. Decide whether this function is injective 
and whether it is surjective. What if it had been defined as cos:R— [-—1,1]? 

4, A function f :Z— Z~xZ is defined as f(n) = (2n,n+8). Verify whether this 
function is injective and whether it is surjective. 

5. A function f :Z— Z is defined as f(n)=2n+1. Verify whether this function is 
injective and whether it is surjective. 

6. A function f :Zx Z— Z is defined as f(m,n)=3n-—4m. Verify whether this 
function is injective and whether it is surjective. 

7. A function f :Zx Z-— Z is defined as f(m,n)=2n—4m. Verify whether this 
function is injective and whether it is surjective. 

8. A function f :Z x Z— ZxZ is defined as f(m,n)=(m+n,2m-+n). Verify whether 


this function is injective and whether it is surjective. 
5x+1 
9. Prove that the function f :R-{2}— R- {5} defined by f(x) = — is bijective. 


x 

3 

10. Prove the function f :R-{1}— R-{1} defined by f(x) = [=] is bijective. 
= 


11. Consider the function 6: {0,1} x N— Z defined as @(a,b) =(—-1)°b. Is @ injective? 
Is it surjective? Bijective? Explain. 


The Pigeonhole Principle 203 


12. Consider the function 6: {0,1}xN-— Z defined as 0(a,b) = a—2ab+b. Is @ injective? 
Is it surjective? Bijective? Explain. 

13. Consider the function f :R? — R? defined by the formula f(x, y)= (xy,x?). Is f 
injective? Is it surjective? Bijective? Explain. 

14. Consider the function 6: A(Z) — A(Z) defined as 6(X)=X. Is @ injective? Is it 
surjective? Bijective? Explain. 

15. This question concerns functions f :{A,B,C,D,E,F,G} — {1,2,3,4,5,6,7}. How 
many such functions are there? How many of these functions are injective? 
How many are surjective? How many are bijective? 

16. This question concerns functions f :{A,B,C,D,E} — {1,2,3,4,5,6,7$. How many 
such functions are there? How many of these functions are injective? How 
many are surjective? How many are bijective? 

17. This question concerns functions f :{A,B,C,D,E,F,G} — {1,2}. How many such 
functions are there? How many of these functions are injective? How many 
are surjective? How many are bijective? 


18. Prove that the function f :N— Z defined as f(n)= ia o ase is bijective. 


12.3 The Pigeonhole Principle 


Here is a simple but useful idea. Imagine there is a set A of pigeons and 
a set B of pigeon-holes, and all the pigeons fly into the pigeon-holes. You 
can think of this as describing a function f : A — B, where pigeon X flies 
into pigeon-hole f(X). Figure 12.4 illustrates this. 


Pigeons Pigeon-holes Pigeons Pigeon-holes 
f f 
c= = 
(a) (b) 


Figure 12.4. The pigeonhole principle 


In Figure 12.4(a) there are more pigeons than pigeon-holes, and it 
is obvious that in such a case at least two pigeons have to fly into the 
same pigeon-hole, meaning that f is not injective. In Figure 12.4(b) there 
are fewer pigeons than pigeon-holes, so clearly at least one pigeon-hole 
remains empty, meaning that f is not surjective. 
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Although the underlying idea expressed by these figures has little to 
do with pigeons, it is nonetheless called the pigeonhole principle: 
The Pigeonhole Principle 
Suppose A and B are finite sets and f: A —B is any function. Then: 


1. If |A|>|B|, then f is not injective. 
2. If |A| <|B|, then f is not surjective. 


Though the pigeonhole principle is obvious, it can be used to prove 
some things that are not so obvious. 


Example 12.7 Prove the following proposition. 


Proposition If A is any set of 10 integers between 1 and 100, then there 
exist subsets X CA and Y CA for which the sum of elements in X equals 
the sum of elements in Y. 


To illustrate what this proposition is saying, consider the random set 
A= {5, 7,12,11,17,50,51, 80,90, 100} 


of 10 integers between 1 and 100. Notice that A has subsets X = {5,80} and 
Y = {7,11,17,50} for which the sum of the elements in X equals the sum of 
those in Y. If we tried to “mess up” A by changing the 5 to a 6, we get 


A = {6,7,12, 11, 17,50,51,80, 90, 100} 


which has subsets X = {7,12,17,50} and Y = {6,80} both of whose elements 
add up to the same number (86). The proposition asserts that this is always 
possible, no matter what A is. Here is a proof: 


Proof. Suppose A € {1,2,3,4,...,99,100} and |A| = 10, as stated. Notice that 
if X <A, then X has no more than 10 elements, each between 1 and 100, 
and therefore the sum of all the elements of X is less than 100-10 = 1000. 
Consider the function 


f : P(A) — {0,1,2,3,4,..., 1000} 


where /(X) is the sum of the elements in X. (Examples: f ({3,7,50}) = 60; 
f ({1,70,80,95}) = 246.) As | A(A)| = 21° = 1024 > 1001 = |{0,1,2,3,...,1000}|, 
it follows from the pigeonhole principle that f is not injective. Therefore 
there are two unequal sets X,Y € A(A) for which f(X)=/f(Y). In other 
words, there are subsets X <A and Y CA for which the sum of elements 
in X equals the sum of elements in Y. a 
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Example 12.8 Prove the following proposition. 


Proposition There are at least two Texans with the same number of 
hairs on their heads. 


Proof. We will use two facts. First, the population of Texas is more than 
twenty million. Second, it is a biological fact that every human head 
has fewer than one million hairs. Let A be the set of all Texans, and 
let B = {0,1,2,3,4,...,1000000}. Let f: A — B be the function for which f(x) 
equals the number of hairs on the head of x. Since |A| >|B|, the pigeonhole 
principle asserts that f is not injective. Thus there are two Texans x and 
y for whom f(x) = f(y), meaning that they have the same number of hairs 
on their heads. a 


Proofs that use the pigeonhole principle tend to be inherently non- 
constructive, in the sense discussed in Section 7.4. For example, the above 
proof does not explicitly give us of two Texans with the same number of 
hairs on their heads; it only shows that two such people exist. If we were 
to make a constructive proof, we could find examples of two bald Texans. 
Then they have the same number of head hairs, namely zero. 


Exercises for Section 12.3 


1. Prove that if six numbers are chosen at random, then at least two of them will 
have the same remainder when divided by 5. 


2. If a is a natural number, then there exist two unequal natural numbers & and 
¢ for which a® —a® is divisible by 10. 

3. Prove that if six natural numbers are chosen at random, then the sum or 
difference of two of them is divisible by 9. 

4, Consider a square whose side-length is one unit. Select any five points from 
inside this square. Prove that at least two of these points are within v2 units 
of each other. 


5. Prove that any set of seven distinct natural numbers contains a pair of numbers 
whose sum or difference is divisible by 10. 


6. Given a sphere S, a great circle of S is the intersection of S with a plane 
through its center. Every great circle divides S into two parts. A hemisphere 
is the union of the great circle and one of these two parts. Prove that if five 
points are placed arbitrarily on S, then there is a hemisphere that contains 
four of them. 
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12.4 Composition 


You should be familiar with the notion of function composition from algebra 
and calculus. Still, it is worthwhile to revisit it now with our more 
sophisticated ideas about functions. 


Definition 12.5 Suppose f:A—B and g:B—C are functions with the 
property that the codomain of f equals the domain of g. The composition 
of f with g is another function, denoted as gof and defined as follows: If 
x€A, then gof (x)= g(f(x)). Therefore gof sends elements of A to elements 
of C, so gof:A—-C. 


The following figure illustrates the definition. Here f:A—B, g:B—C, 
and gof:A—C. We have, for example, gof(0)= g(f(0)) = g(2)=4. Be very 
careful with the order of the symbols. Even though g comes first in the 
symbol gof, we work out gof(x) as g(f(x)), with f acting on x first, followed 
by g acting on f(x). 


Figure 12.5. Composition of two functions 


Notice that the composition gof also makes sense if the range of f 
is a subset of the domain of g. You should take note of this fact, but to 
keep matters simple we will continue to emphasize situations where the 
codomain of f equals the domain of g. 


Example 12.9 Suppose A = {a,b,c}, B= {0,1}, C= {1,2,3}. Let f:A—B 
be the function f = {(a,0),(b,1),(c,0)}, and let g:B—C be the function 
g = {(0,3),(1, )}. Then gof ={(a,3),(b, 1),(c,3)}. 

Example 12.10 Suppose A = {a,b,c}, B= {0,1}, C = {1,2,3}. Let f:A—-B 
be the function f = {(a,0),(b,1),(c,0)}, and let g:C — B be the function 
g = {(1,0),(2,1),(3,1)}. In this situation the composition gof is not defined 
because the codomain B of f is not the same set as the domain C of g. 
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Remember: In order for gof to make sense, the codomain of f must equal 
the domain of g. (Or at least be a subset of it.) 


Example 12.11 Let f:R—R be defined as f(x)=x?+x, and g:R—R be 
defined as g(x)=x+1. Then gof:R—R is the function defined by the 
formula go f (x)= g(f(x))= g(x?+x)=x2%4+x41. 

Since the domains and codomains of g and f are the same, we can in 
this case do a composition in the other order. Note that fog:R—R is the 
function defined as f o g(x) = f(g(x)) = f(t 1) = («+ 1)? +(x+ Dax? +3x42. 

This example illustrates that even when gof and fog are both defined, 
they are not necessarily equal. We can express this fact by saying function 
composition is not commutative. 


We close this section by proving several facts about function composition 
that you are likely to encounter in your future study of mathematics. First, 
we note that, although it is not commutative, function composition is 
associative. 


Theorem 12.1 Composition of functions is associative. That is if f:A— B, 
g:B—C andh:C—D, then (hog)of =ho(gof). 


Proof. Suppose f,g,h are as stated. It follows from Definition 12.5 that 
both (hog)of and ho(gof) are functions from A to D. To show that they 
are equal, we just need to show 


((eog)of)(x) = (ho(gof)}() 
for every x¢ A. Note that Definition 12.5 yields 


((eog)o f(x) = (hog f(a) = hala). 


Also 
[no(gof)\(x) = hlgof (x) = Alef). 
Thus 
(reg) f)(x)=(ho(gof)}(w, 
as both sides equal h(g(f(x))). a 


Theorem 12.2 Suppose f:A—B and g:B—C. If both f and g are 
injective, then gof is injective. If both f and g are surjective, then gof is 
surjective. 
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Proof. First suppose both f and g are injective. To see that gof is injective, 
we must show that gof (x)= gof(y) implies x= y. Suppose gof(x)=gof(y). 
This means g(f(x)) = g(f(y)). It follows that f(x) = f(y). (For otherwise g 
wouldn’t be injective.) But since f(x) = f(y) and f is injective, it must be 
that x= y. Therefore gof is injective. 

Next suppose both f and g are surjective. To see that gof is surjective, 
we must show that for any element ce C, there is a corresponding element 
a€éA for which gof(a)=c. Thus consider an arbitrary ce C. Because g 
is surjective, there is an element b¢€ B for which g(b)=c. And because 
f is surjective, there is an element a¢ A for which f(a)= 06. Therefore 
g(f(a)) = g(b)=c, which means gof(a)=c. Thus gof is surjective. 5 


Exercises for Section 12.4 


1. Suppose A = {5,6,8}, B= {0,1}, C = {1,2,3}. Let f: A —B be the function f = 
{(5,1),(6,0),(8, D}, and g:B—C be g= {(0,),(1, D}. Find gof. 
2. Suppose A = {1,2,3,4}, B= {0,1,2}, C= {1,2,3}. Let f:A—B be 


f = {(1,0), (2, D, (3, 2), (4, O)}, 


and g:B—C be g= {(0,1),(1,1),(2,3)}. Find gof. 

3. Suppose A = {1,2,3}. Let f:4—A be the function f = {(1,2),(2,2),(3,1)}, and let 
g:A—A be the function g = {(1,3),(2,1),(8,2)}. Find gof and fog. 

4. Suppose A = {a,b,c}. Let f: A —A be the function f = {(a,c),(b,c),(c,c)}, and let 
g:A—A be the function g = {(a,a),(b,b),(c,a)}. Find gof and fog. 

5. Consider the functions f,g:R— R defined as f(x) = Vx+1 and g(x)=x°. Find 
the formulas for gof and fog. 

6. Consider the functions f,g:R— R defined as f(x) = = and g(x) =3x+2. Find 
the formulas for gof and fog. 

7. Consider the functions f,g:Zx Z— ZxZ defined as f(m,n) = (mn,m?) and 
g(m,n)=(m+1,m+n). Find the formulas for gof and fog. 

8. Consider the functions f,g:ZxZ-— Z~xZ defined as f(m,n) =(8m—4n,2m+n) 
and g(m,n)=(5m+n,m). Find the formulas for gof and fog. 

9. Consider the functions f :Z x Z— Z defined as f(m,n)=m+n and g:Z—-ZxZ 
defined as g(m)=(m,m). Find the formulas for gof and fog. 

10. Consider the function f :R? — R? defined by the formula f(x,y) = (xy,x?). Find 

a formula for fof. 
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12.5 Inverse Functions 


You may recall from calculus that if a function f is injective and surjective, 
then it has an inverse function f~! that “undoes” the effect of f in the 
sense that f~1(f(«)) =x for every x in the domain. (For example, if f(x) = x°, 
then f~1(x) = %/x.) We now review these ideas. Our approach uses two 
ingredients, outlined in the following definitions. 


Definition 12.6 Given a set A, the identity function on A is the func- 
tion i4:A—A defined as i,(x)=«x for every xE A. 


Example: If A = {1,2,3}, then i, = {(1, 1), (2,2),(3,3)}. Also iz = {(n,n):ne Z}. 
The identity function on a set is the function that sends any element of 
the set to itself. 

Notice that for any set A, the identity function i, is bijective: It is 
injective because i,4(x)=i,(y) immediately reduces to x= y. It is surjective 
because if we take any element 0b in the codomain A, then 6 is also in the 
domain A, and i,(b)=5. 


Definition 12.7 Given a relation R from A to B, the inverse relation 
of R is the relation from B to A defined as R~' = {(y,x):(x,y)€R}. In other 
words, the inverse of R is the relation R~! obtained by interchanging the 
elements in every ordered pair in R. 

For example, let A = {a,b,c} and B = {1,2,3}, and suppose f is the 
relation f = {(a,2),(b,3),(c,1)} from A to B. Then f-! = {(2,a),(3,5),(1,0)} 
and this is a relation from B to A. Notice that f is actually a function 
from A to B, and f~! is a function from B to A. These two relations are 
drawn below. Notice the drawing for relation f~! is just the drawing for f 
with arrows reversed. 


A B A B 
Z Z 
f = {(a,2),(6,3),(c, I} f-+ = {(2,a),(3,0),(1,c)} 
For another example, let A and B be the same sets as above, but consider 


the relation g = {(a,2),(b,3),(c,3)} from A to B. Then g™! = {(2,a),(3,b),(3,c)} 
is a relation from B to A. These two relations are sketched below. 
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A B A B 
= N 


g = {(a,2),(b,3),(c,3)} g ! ={(2,a),(3,b),(3,c)} 


This time, even though the relation g is a function, its inverse g™! is 
not a function because the element 3 occurs twice as a first coordinate of 
an ordered pair in g™!. 

In the above examples, relations f and g are both functions, and f~! is 
a function and g™! is not. This raises a question: What properties does f 
have and g lack that makes f~! a function and g™! not a function? The 
answer is not hard to see. Function g is not injective because g(b) = g(c) = 8, 
and thus (0,3) and (c,3) are both in g. This causes a problem with g™! 
because it means (3,b) and (3,c) are both in g™!, so g™! can’t be a function. 
Thus, in order for g~! to be a function, it would be necessary that g be 
injective. 

But that is not enough. Function g also fails to be surjective because 
no element of A is sent to the element 1¢B. This means g™! contains no 
ordered pair whose first coordinate is 1, so it can’t be a function from B to 
A. If g~ were to be a function it would be necessary that g be surjective. 

The previous two paragraphs suggest that if g is a function, then it 
must be bijective in order for its inverse relation g! to be a function. 
Indeed, this is easy to verify. Conversely, if a function is bijective, then its 
inverse relation is easily seen to be a function. We summarize this in the 
following theorem. 


Theorem 12.3 Let f:A—B bea function. Then f is bijective if and only 
if the inverse relation f~! is a function from B to A. 


Suppose f : A — B is bijective, so according to the theorem f7! is a 
function. Observe that the relation f contains all the pairs (x, f(x)) for xe A, 
so f~! contains all the pairs (f(x),x). But (f(«),x)¢ f~! means f~(f(x)) = x. 
Therefore f~!of (x) =x for every x¢ A. From this we get f~!of =i,. Similar 
reasoning produces fof~!=ig. This leads to the following definitions. 


Definition 12.8 If f:A-—B is bijective then its inverse is the function 
f-!:B—A. Functions f and f~! obey the equations f~tof =i, and 
fof SiR 
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You probably recall from algebra and calculus at least one technique 
for computing the inverse of a bijective function f: to find f~1, start with 
the equation y = f(x). Then interchange variables to get x = f(y). Solving 
this equation for y (if possible) produces y = f~'(x). The next two examples 
illustrate this. 


Example 12.12 The function f :R—R defined as f(x) =x°+1 is bijective. 
Find its inverse. 


We begin by writing y = x?+1. Now interchange variables to obtain 
x= y°+1. Solving for y produces y= Vx—1. Thus 


fla) = Wx—1. 
(You can check your answer by computing 
fF) = Yf@)-1= Vx84+1-1=x. 


Therefore f~(f(x)) =x. Any answer other than x indicates a mistake.) 


We close with one final example. Example 12.5 showed that the function 
g:ZxZ—Z-xZ defined by the formula g(m,n)= (m+n,m+2n) is bijective. 
Let’s find its inverse. The approach outlined above should work, but we 
need to be careful to keep track of coordinates in Zx Z. We begin by 
writing (x,y) = g(m,n), then interchanging the variables (x,y) and (m,n) to 
get (m,n) = g(x,y). This gives 


(m,n)=(x+y,x+2y), 
from which we get the following system of equations: 


x + y=m 
x + Q2y = n. 


Solving this system using techniques from algebra with which you are 
familiar, we get 
x = 2m-n 
y = n-m. 


Then (x,y) =(2m—n,n—m), so|g~(m,n) =(2m—n,n-m). 
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We can check our work by confirming that g~!(g(m,n)) = (m,n). Doing 
the math, 


ge (gimn)) = g \m+n,m+2n) 
= (2(m+n)—(m+2n),(m+2n)-(m+n)) 


(m,n). 


Exercises for Section 12.5 


1. Check that the function f : Z — Z defined by f(n) = 6—n is bijective. Then 
compute f~!. 

2. In Exercise 9 of Section 12.2 you proved that f : R- {2} — R- {5} defined by 

1 
f (x)= sd ~ is bijective. Now find its inverse. 
pa 

3. Let B= {2":neZ}={...,4,5,1,2,4,8,...}. Show that the function f:Z—B 
defined as f(n)= 2” is bijective. Then find f-!. 

4, The function f :R— (0,00) defined as f(x) = er tl ig bijective. Find its inverse. 


5. The function f:R—R defined as f(x)=2x-e is bijective. Find its inverse. 

6. The function f :Zx Z— Z~x Z defined by the formula f(m,n)= (5m +4n,4m + 3n) 
is bijective. Find its inverse. 

7. Show that the function f :R? — R? defined by the formula f(x, y) = ((x? + Dy,x°) 
is bijective. Then find its inverse. 

8. Is the function 6: A(Z) — A(Z) defined as 6(X) =X bijective? If so, what is its 
inverse? 

9. Consider the function f :RxN—N-xR defined as f(x,y) =(y,3xy). Check that 
this is bijective; find its inverse. 


10. Consider f :N— Z defined as f(n)= cD or =F 7 This function is bijective 


by Exercise 18 in Section 12.2. Find its inverse. 


12.6 Image and Preimage 


It is time to take up a matter of notation that you will encounter in future 
mathematics classes. Suppose we have a function f:A—B. If X cA, the 
expression f(X) has a special meaning. It stands for the set {f(x):x¢ X}. 
Similarly, if Y ¢B then f~1(Y) has a meaning even if f is not invertible. 
The expression f~'(Y) stands for the set {xe A: f(x)¢Y}. Here are the 
precise definitions. 
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Definition 12.9 Suppose f:A—B is a function. 
1. If X CA, the image of X is the set f(X)= {f(x):xeX}CB. 
2. If Y CB, the preimage of Y is the set f-'(Y)={xeA:f(x)EY}CA. 


In words, the image f(X) of X is the set of all things in B that f sends 
elements of X to. (Roughly speaking, you might think of f(X) as a kind of 
distorted “copy” or “image” of X in B.) The preimage f~1(Y) of Y is the set 
of all things in A that f sends into Y. 

Maybe you have already encountered these ideas in linear algebra, in 
a setting involving a linear transformation T :V — W between two vector 
spaces. If X <V is a subspace of V, then its image 7(X) is a subspace of W. 
If Y CW is a subspace of W, then its preimage T~1(Y) is a subspace of V. 
(If this does not sound familiar, then ignore it.) 


Example 12.13 Let /: {s,t,u,v,w,x,y,z} — {0,1,2,3,4,5,6,7,8,9}, where 
f = {(s, 4), (t,8), (u, 8), (v, 1), (w, 2), (x, 4), (y, 6), (2, 4)}- 


Notice that f is neither injective nor surjective, so it certainly is not 
invertible. Be sure you understand the following statements. 


1. f({s,t,u,z}) = {8,4} 
f ({s,x,z}) = {4} 

F(ts, v,w,y}) = {1,2,4,6} 
f* ({4}) = {s,x,2} 
. f-*({4,9}) ={s,x,2} 
f-M({9) =@ 

f. 


( 
1({1,4,8}) = {s,t,u,v,x,z} 


STO) OF op So 


It is important to realize that the X and Y in Definition 12.9 are 
subsets (not elements!) of A and B. Thus, in the above example we had 
~1({4}) = {s,x,z}, though f~1(4) has absolutely no meaning because the 
inverse function f~! does not exist. Likewise, there is a subtle difference 
between f({a}) = {4} and f(a)=4. Be careful. 


Example 12.14 Consider the function f :R— R defined as f(x) = x?. 
Note that f({0,1,2}) = {0,1,4} and f~1({0,1,4}) = {-2,-1,0,1,2}. This shows 
f-\(f(X)) 4X in general. 

Now check your understanding of the following statements involving 
images and preimages of intervals. f([—2,3]) = [0,9], and f~1((0,9]) = [—3,3]. 
Also f(R) =[0,co) and f~({-2,-1) = @ 
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If you continue your mathematical studies, you are likely to encounter 


the following result in the future. For now, you are asked to prove it in 
the exercises. 


Theorem 12.4 Suppose f:A— Bisa function. LetW,X CA, and Y,ZCB. 
Then: 


1. fWax)cfW)nf(X) 

. fWUX)= f(W)U F(X) 

_ fn Z)= Ff Wn") 
_f WZ uZ)=f Yup") 
_Xof“(f(®) 

-Ycf(f iY) 


HD oF C LD 


Exercises for Section 12.6 


1. 


2. 


Consider the function f :R—R defined as f(x) =x?+3. Find f([-3,5]) and 
f-*(112, 19D). 
Consider the function f : {1,2,3,4,5,6,7} — {0,1,2,3,4,5,6,7,8,9} given as 


f = {(1,3), (2,8), (3,3), (4, D, (5, 2), (6, 4),(7, 6). 


Find: f({1,2,3}), f({4,5,6,7}), £(@), f-1({0,5,9}) and f~1({0,3,5,9}). 


. This problem concerns functions f : {1,2,3,4,5,6,7} — {0,1,2,3,4}. How many 


such functions have the property that |f~1({3})| = 3? 


. This problem concerns functions f : {1,2,3,4,5,6,7,8} — {0,1,2,3,4,5,6}. How 


many such functions have the property that |f~1({2})| = 4? 


. Consider a function f: A —B and a subset X CA. We observed in Section 12.6 


that f-(f(X)) 4X in general. However X ¢ f~!(f(X)) is always true. Prove this. 


. Given a function f : A —B and a subset Y CB, is f(f-(Y))=Y always true? 


Prove or give a counterexample. 


. Given a function f:A—B and subsets W,X cA, prove f(WnNX)S f(W)nf(X). 
. Given a function f:A—B and subsets W,X cA, then f(WnX)= f(W) nf (X) is 


false in general. Produce a counterexample. 


. Given a function f:A— 8B and subsets W,X CA, prove f(WUX)= f(W)Uf(X). 
. Given f:A—B and subsets Y,Z CB, prove f-(Y nZ)=f-'(Y)nf-(Z). 

. Given f:A—B and subsets Y,Z <B, prove f-'(Y UZ) =f-"(Y)uf-(Z). 

. Consider a function f:A— B. Prove that f is invertible if and only if X = 


f-1(f(X)) for every X CA. 


. Let f:A—B be a function, and X <A. Prove or disprove: f(f- 1) = f (X). 
. Letf: A —B be a function, and Y CB. Prove or disprove: f~!(f(f-1(Y))) = f-1(Y). 


CHAPTER 13 


Cardinality of Sets 


his chapter is all about cardinality of sets. At first this looks like a 
very simple concept. To find the cardinality of a set, just count its 
elements. If A = {a,b,c,d}, then |A|=4; if B= {ne Z:-5<n<5}, then 
|B| =11. In this case |A|<|B|. What could be simpler than that? 
Actually, the idea of cardinality becomes quite subtle when the sets 
are infinite. The main point of this chapter is to explain how there are 
numerous different kinds of infinity, and some infinities are bigger than 
others. Two sets A and B can both have infinite cardinality, yet |A| <|B|. 


13.1 Sets with Equal Cardinalities 


We begin with a discussion of what it means for two sets to have the 
same cardinality. Up until this point we’ve said |A|=|B| if A and B have 
the same number of elements: Count the elements of A, then count the 
elements of B. If you get the same number, then |A| = |B|. 

Although this is a fine strategy if the sets are finite (and not too big!), 
it doesn’t apply to infinite sets because we’d never be done counting their 
elements. We need a new approach that applies to both finite and infinite 
sets. Here it is: 


Definition 13.1 Two sets A and B have the same cardinality, written 
|A| =|B|, if there exists a bijective function f:A— B. If no such bijective 
function exists, then A and B have unequal cardinalities, that is |A| 4 |B|. 


The above picture illustrates our definition. There is a bijective function 
f:A—B, so |A|=|B|. The function f matches up A with B. Think of f as 
describing how to overlay A onto B so that they fit together perfectly. 
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On the other hand, if A and B are as indicated in either of the following 
figures, then there can be no bijection f:A— B. (The best we can do is a 
function that is either injective or surjective, but not both). Therefore the 
definition says |A|#|B| in these cases. 


Example 13.1 The sets A={neZ:0<n<5} and B={neZ:-5<n<0} 
have the same cardinality because there is a bijective function f:A—B 
given by the rule f(n)=-—n. 


Several comments are in order. First, if |A|=|B|, there can be lots of 
bijective functions from A to B. We only need to find one of them in order to 
conclude |A|=|B|. Second, as bijective functions play such a big role here, 
we use the word bijection to mean bijective function. Thus the function 
f(n)=-—n from Example 13.1 is a bijection. Also, an injective function is 
called an injection and a surjective function is called a surjection. 

We emphasize and reiterate that Definition 13.1 applies to finite as 
well as infinite sets. If A and B are infinite, then |A|=|B| provided there 
exists a bijection f:A— B. If no such bijection exists, then |A| 4 |B|. 


Example 13.2 This example shows that |N| =|Z|. To see why this is true, 
notice that the following table describes a bijection f :N— Z. 


n ji 2 3 4 5 6 7 8 9 10 11 12 18 14 15 


Gd iO ts Go Ser 8 BB ed 5 -5 6 -6 7 -T 


Notice that f is described in such a way that it is both injective and 
surjective. Every integer appears exactly once on the infinitely long second 
row. Thus, according to the table, given any b€ Z there is some natural 
number n with f(n)=b, so f is surjective. It is injective because the way 
the table is constructed forces f(m) 4 f(n) whenever m#n. Because of this 
bijection f :N— Z, we must conclude from Definition 13.1 that |N| =|Z|. 


Example 13.2 may seem slightly unsettling. On one hand it makes 
sense that |N| =|Z| because N and Z are both infinite, so their cardinalities 
are both “infinity.” On the other hand, Z seems twice as large as N because 
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Z has all the negative integers as well as the positive ones. Definition 13.1 
settles the issue by producing |N|=|Z|. We summarize this with a theorem. 


Theorem 13.1 There exists a bijection f:N— Z. Therefore |N| = |Z|. 


The fact that N and Z have the same cardinality might prompt us 
compare the cardinalities of other infinite sets. How, for example, do N 
and R compare? Let’s turn our attention to this. 

In fact, |N| 4|R|. This was first recognized by Georg Cantor (1845-1918), 
who devised an ingenious argument to show that there are no surjective 
functions f:N—R. This in turn implies that there can be no bijections 
f :N—R, so |N| 4|R| by Definition 13.1. 

We now describe Cantor’s argument for why there are no surjections 
f:N—R. We will reason informally, rather than writing out an exact proof. 
Take any arbitrary function f :N—R. Here’s why f can’t be surjective: 


Imagine making a table for f, where values of n in N are in the left- 
hand column and the corresponding values f(n) are on the right. The 
first few entries might look something as follows. In this table, the real 
numbers f(n) are written with all their decimal places trailing off to the 
right. Thus, even though f(1) happens to be the real number 0.4, we write 
it as 0.40000000...., etc. 


f(n) 


-40000000000000... 
.596060708666900... 
.90500940044101... 
.590704008048050... 
-9DD0Z6E000000506... 
.82809582050020... 
.890505550655808... 
.72080640000448... 
.59500008888007 7... 
.90020722078051... 
.90000880000900... 
.90280008009671... 
.89008024008050... 
.90008742080226.., 


a 
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There is a diagonal shaded band in the table. For each neéN, this band 
covers the n“” decimal place of f(n): 


The 1st decimal place of f(1) is the 1st entry on the diagonal. 

The 2nd decimal place of f(2) is the 2nd entry on the diagonal. 
The 3rd decimal place of f(3) is the 3rd entry on the diagonal. 
The 4th decimal place of f(4) is the 4th entry on the diagonal, etc. 


The diagonal helps us construct a number b € R that is unequal to any f(n). 
Just let the nth decimal place of b differ from the nth entry of the diagonal. 
Thus the nth decimal place of b differs from the nth decimal place of f(n). 
In order to be definite, define b to be the number between 0 and 1 whose 
nth decimal place is 6 if the nth decimal place of f(n) is not 6, and whose 
nth decimal place is 2 if the nth decimal place of f(n) equals 6. Thus, for 
the function f illustrated in the above table, we have 


b = 0.62662666666262... 


and b has been defined so that, for any né€N, its nth decimal place is 
unequal to the nth decimal place of f(n). Therefore f(n) 4 b for every 
natural number n, meaning f is not surjective. 

Since this argument applies to any function f :N— R (not just the one 
in the above example) we conclude that there exist no bijections f :N—R, 
so |N| 4|R| by Definition 13.1. We summarize this as a theorem. 


Theorem 13.2 There exists no bijection f:N—R. Therefore |N| 4 |R|. 


Exercises for Section 13.1 
Show that the two given sets have equal cardinality by describing a bijection from 
one to the other. Describe your bijection with a formula (i.e. not as a table). 


1. R and (0,00) 9. {0,1}N and N 
2. R and (v2,00) 10. {0,1}xN and Z 
3. R and (0,1) fi tena 
. [0,1] and (0, 

4. The set of even integers and 

the set of odd integers 12. N and Z (Suggestion: use Exercise 18 
5. A={3k:keZ} and B={7k:keZ} of Section 12.2.) 
6. Nand S={¥2 :neN} 13, AN) and AZ) 
7 PF and:s= {...,4,4,4,1,2,4,8,16,...} (Suggestion: use Exercise 12, above.) 
8. Z and S={xeR:sinx=1} 14. NxN and {(n,m)eENxN:n<m} 


15. Find a formula for the bijection f in Example 13.2 (page 216). 
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13.2 Countable and Uncountable Sets 

Let’s summarize the main points from the previous section. 
1. |A| =|B| if and only if there exists a bijection f:A— B. 

2. |N|=|Z| because there exists a bijection f :N— Z. 

3. |N| #4 |R| because there exists no bijection f:N—R. 


Thus, even though N, Z and R are all infinite sets, their cardinalities 
are not all the same. The sets N and Z have the same cardinality, but R’s 
cardinality is different from that of both the other sets. This is our first 
indication of how infinite sets can have different sizes, and we will now 
make some definitions to put words and symbols to this phenomenon. 

In a certain sense you can count the elements of N; you can count its 
elements off as 1,2,3,4,..., but you’d have to continue this process forever 
to count the whole set. Thus we will call N a countably infinite set, and 
the same term is used for any set whose cardinality equals that of N. 


Definition 13.2 Suppose A is a set. Then A is countably infinite 
if |N| = |A|, that is, if there exists a bijection f:N— A. The set A is 
uncountable if A is infinite and |N| Z|A|, that is, if A is infinite and there 
exists no bijection f:N— A. 


Thus Z is countably infinite but R is uncountable. This section deals 
mainly with countably infinite sets. Uncountable sets are treated later. 

If A is countably infinite, then |N| =|Al|, so there is a bijection f:N— A. 
You can think of f as “counting” the elements of A. The first element of A 
is f(1), followed by f(2), then f(3) and so on. It makes sense to think of a 
countably infinite set as the smallest type of infinite set, because if the 
counting process stopped, the set would be finite, not infinite; a countably 
infinite set has the fewest number of elements that a set can have and 
still be infinite. It is common to reserve the special symbol Xo to stand for 
the cardinality of countably infinite sets. 


Definition 13.3 The cardinality of the natural numbers is denoted as Xo. 
That is, IN| =o. Thus any countably infinite set has cardinality No. 


(The symbol X is the first letter in the Hebrew alphabet, and is pronounced 
“aleph.” The symbol Xo is pronounced “aleph naught.”) The summary of 
facts at the beginning of this section shows |Z| = No and |R| 4 Xo. 


Example 13.3 Let E = {2k :k € Z} be the set of even integers. The function 
f:Z—E defined as f(n) = 2n is easily seen to be a bijection, so we have 
|Z|=|E|. Thus, as |N| =|Z|=|E|, the set FE is countably infinite and |E|=No. 
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Here is a significant fact: The elements of any countably infinite set A 
can be written in an infinitely long list a1,a2,a3,a4,... that begins with some 
element a;¢A and includes every element of A. For example, the set E in 
the above example can be written in list form as 0,2, —-2,4,—4,6,—-6,8,-8,... 
The reason that this can be done is as follows. Since A is countably infinite, 
Definition 13.2 says there is a bijection f:N— A. This allows us to list 
out the set A as an infinite list f(1), f(2), f(3), f(4),... Conversely, if the 
elements of A can be written in list form as aj,a9,q3,..., then the function 
f:N—A defined as f(n)=a, is a bijection, so A is countably infinite. We 
summarize this as follows. 


Theorem 13.3 A set A is countably infinite if and only if its elements 
can be arranged in an infinite list a1,a2,a3,a4,... 


As an example of how this theorem might be used, let P denote the set 
of all prime numbers. Since we can list its elements as 2,3,5,7,11,138,..., it 
follows that the set P is countably infinite. 

As another consequence of Theorem 13.3, note that we can interpret the 
fact that the set R is not countably infinite as meaning that it is impossible 
to write out all the elements of R in an infinite list. (After all, we tried to 
do that in the table on page 217, and failed!) 

This raises a question. Is it also impossible to write out all the elements 
of Q in an infinite list? In other words, is the set Q of rational numbers 
countably infinite or uncountable? If you start plotting the rational num- 
bers on the number line, they seem to mostly fill up R. Sure, some numbers 
such as V2, z and e will not be plotted, but the dots representing rational 
numbers seem to predominate. We might thus expect Q to be uncountable. 
However, it is a surprising fact that Q is countable. The proof presented 
below arranges all the rational numbers in an infinitely long list. 


Theorem 13.4 The set Q of rational numbers is countable. 


Proof. To prove this, we just need to show how to write the set Q in list 
form. Begin by arranging all rational numbers in an infinite array. This is 
done by making the following chart. The top row has a list of all integers, 
beginning with 0, then alternating signs as they increase. Each column 
headed by an integer k contains all the fractions (in reduced form) with 
numerator k. For example, the column headed by 2 contains the fractions 
2. 2 2, Pies It does not contain z 2, 2, etc., because those are not reduced, 
and in fact their reduced forms appear in the column headed by 1. You 
should examine this table and convince yourself that it contains all rational 
numbers in Q. 
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Ot SB 28 Oe a0 ee Sea ea OR 

0 1 -1 2 -2 3 -3 4 -4 5 -5 

1 1 1 1 1 1 1 1 1 1 1 
1 -1 2 -2 5) -3 4 -4 5 -5 
2 2 3 3 2 2 3 3 2 2 
1 -1 2 -2 3 -3 4 -4 5 -5 
3 3 5 5 4 4 5 5 3 3 
1 o-1 2 -2 8 -8 4 =4 5B =5 
4 4 7 7 5 5 7 7 4 4 
1 -1 2 -2 3 -3 4 -4 5 -5 
5 5 9 9 7 7 9 9 6 6 
iL -1 2 -2 3 -3 4 —4 5 -5 
6 6 11 11 8 8 11 11 7 7 
1-1 2 -2 38 -8 4 =-4 5 =5 
4 13 13 8 
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Next, draw an infinite path in this array, beginning at ¢ and snaking 
back and forth as indicated below. Every rational number is on this path. 


0 


1 


-1 


2 


-2 


3 


—3 


4 


—4 


5 


—5 
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Beginning at ° and following the path, we get an infinite list of all 
rational numbers: 


01 1 1 i 22 1131 3142 2 2 2 2 2.3 3 
? > 9? 2” > "3? 5” 3’ 3° 4’ 4’ Te 7B 5” 3’ 3’ > > 9? 
Therefore, by Theorem 13.3, it follows that Q is countably infinite. a 


It is also true that the Cartesian product of two countably infinite sets 
is itself countably infinite, as our next theorem states. 


Theorem 13.5 If A and B are both countably infinite, then so is A x B. 


Proof. Suppose A and B are both countably infinite. By Theorem 13.3, we 
know we can write A and B in list form as 


A = {a1,a2,a3,@4,...}, 
B {b1,b9,b3, ba,...}. 


Figure 13.1 shows how to form an infinite path winding through all of AxB. 
Therefore A x B can be written in list form, so it is countably infinite. 


B 


aise jae (i5.80 eo ea eae eo 
(a1, 66) (a2, b6) (ag, b6) (a4, b6) (a5, 6) (26,06) (a7, b6) -°° 
(a1, 65) (a2,65) (a3,65) (a4,b5) (a5,65) (6,65) (a7,65) °° 
(a1, 64) (a2,64) (a3, 64) (a4,64) (5,64) (26,64) (a7, 64) - °° 
(a1, 63) (a2,63) (a3,63) (a4,b3) (a5,63) (6,63) (a7,63) *° 
(a1, 62) (a2, b2) (a3,62) (a4, 62) (a5, 62) (a6, 62) (a7, 62) °° 


(a1,61) (a2,61) (43,61) (44,61) (45,64) (ag,61) (47,61) °° 


Figure 13.1. The product of countably infinite sets is countably infinite 


Countable and Uncountable Sets 223 


As an example of a consequence of this theorem, notice that since Q is 
countably infinite, the set Q x Q is also countably infinite. 

Recall that the word “corollary” means a result that follows easily from 
some other result. We have the following corollary of Theorem 13.5. 


Corollary 13.1 Given n countably infinite sets A 1,A2,A3,...,An, with 
n= 2, the Cartesian product A, x Ag x A3x-:-xA, is also countably infinite. 


Proof. The proof is by induction. For the basis step, notice that when 
n=2 the statement asserts that for countably infinite sets A; and Ag, the 
product A, x Ag is countably infinite, and this is true by Theorem 13.5. 

Assume that for k =>2, any product A, x Ag x A3x-:- xA,z of countably 
infinite sets is countably infinite. Consider a product A; x Ag x A3x---xApy1 
of k+1 countably infinite sets. It is easily confirmed that the function 


f:A,xAgxAgx-::xApxApey — (A, xAgxAgx-:+x Ap) x Apey 


Fie eee) S(O xo gap); eee) 


is bijective, so |A, x Ag x A3 Xeee x Ap xApiil =|(A, x Ag x Ag Xeee x Ap) x Apel. 
By the induction hypothesis, (A; x Ag x A3 x-:-x Az) x Agi, is a product of 
two countably infinite sets, so it is countably infinite by Theorem 13.5. As 
noted above, A; x Ag x Ag x-+-x Az x Agi, has the same cardinality, so it too 
is countably infinite. a 


Theorem 13.6 If A and B are both countably infinite, then AUB is 
countably infinite. 


Proof. Suppose A and B are both countably infinite. By Theorem 13.3, we 
know we can write A and B in list form as 


A = {a1,a2,a3,a4,...}, 
B {b1,b9,b3, ba,...}. 


We can “shuffle” A and B into one infinite list for AUB as follows. 
AUB= {a1,b1,a2,b2,03,b3,a4, b4,...}. 


(We agree not to list an element twice if it belongs to both A and B.) 
Therefore, by Theorem 13.3, it follows that AUB is countably infinite. 
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Exercises for Section 13.2 


. Prove that the set A = {In(n):n€N} CR is countably infinite. 
. Prove that the set A = {(m,n)€NxN: m<n} is countably infinite. 
. Prove that the set A = {(5n,-3n):ne Z} is countably infinite. 


. Prove that the set of all irrational numbers is uncountable. (Suggestion: 
Consider proof by contradiction using Theorems 13.4 and 13.6.) 


Pe CFO ND = 


OU 


. Prove or disprove: There exists a countably infinite subset of the set of irrational 
numbers. 


. Prove or disprove: There exists a bijective function f:Q—R. 
. Prove or disprove: The set Q!°° is countably infinite. 
. Prove or disprove: The set Z x Q is countably infinite. 


Soma 


. Prove or disprove: The set {0,1} x N is countably infinite. 

10. Prove or disprove: The set A = qv :n €N} countably infinite. 

11. Describe a partition of N that divides N into eight countably infinite subsets. 
12. Describe a partition of N that divides N into Xo countably infinite subsets. 
13. Prove or disprove: If A={X CN : X is finite}, then |A|=No. 

14. Suppose A = {(m,n)ENxR:n=2m}. Is it true that |N| =|Al? 


13.3 Comparing Cardinalities 


At this point we know that there are at least two different kinds of infinity. 
On one hand, there are countably infinite sets such as N, of cardinality Xo. 
Then there is the uncountable set R. Are there other kinds of infinity 
beyond these two kinds? The answer is “yes,” but to see why we first need 
to introduce some new definitions and theorems. 

Our first task will be to formulate a definition for what we mean by 
|A| <|B|. Of course if A and B are finite we know exactly what this means: 
|A| <|B| means that when the elements of A and B are counted, A is found 
to have fewer elements than B. But this process breaks down if A or B is 
infinite, for then the elements can’t be counted. 

The language of functions helps us overcome this difficulty. Notice 
that for finite sets A and B it is intuitively clear that |A|<|B| if and only 
if there exists an injective function f : A — B but there are no surjective 
functions f:A —B. The following diagram illustrates this: 
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We will use this idea to define what is meant by |A|<|B|. For emphasis, 
the following definition also restates what is meant by |A|= |BI. 


Definition 13.4 Suppose A and B are sets. 
(1) |A| =|B| means there is a bijection f:A —B. 
(2) |A| < |B| means there is an injection f : A — B, but no surjection f: A — B. 


For example, consider N and R. The function f :N— R defined as f(n)=n 
is clearly injective, but it is not surjective because given the element 5 ER, 
we have f(n) # 5 for every néEN. In fact, Theorem 13.2 of Section 13.1 
asserts that there is no surjection N—R. Definition 13.4 yields 


IN| < IRI. (13.1) 


Said differently, Xo < |RI. 
Is there a set X for which |R| < |X|? The answer is “yes,” and the next 
theorem explains why. Recall that Y(A) denotes the power set of A. 


Theorem 13.7 If A is any set, then |A|<|A(A)|. 


Proof. Before beginning the proof, we remark that this statement is obvious 
if A is finite, for then |A| <2!4! =|A(A)|. But our proof must apply to all 
sets A, both finite and infinite, so it must use Definition 13.4. 

We prove the theorem with direct proof. Let A be an arbitrary set. 
According to Definition 13.4, to prove |A| <|A(A)| we must show that there 
is an injection f:A — A(A), but no surjection f:A— AA). 

To see that there is an injection f: A — A(A), define f by the rule 
f(x) = {x}. In words, f sends any element x of A to the one-element set 
{x$e P(A). Then f:A — AA) is injective, as follows. Suppose f(x) = f(y). 
Then {x} ={y}. Now, the only way that {x} and {y} can be equal is if x= y, 
so it follows that x= y. Thus f is injective. 

Next we need to show that there exists no surjection f:A— A(A). 
Suppose for the sake of contradiction that there does exist a surjection 
f:A— AA). Notice that for any element xe A, we have f(x)€ A(A), so 
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f(x) is a subset of A. Thus f is a function that sends elements of A to 
subsets of A. It follows that for any x¢ A, either x is an element of the 
subset f(x) or it is not. Using this idea, define the following subset B of A: 


B= {xe A:x€ f(x)CA. 


Now since BCA we have Be “(A), and since f is surjective there is an 
a€éA for which f(a)=B. Now, either ace B or a¢ B. We will consider these 
two cases separately, and show that each leads to a contradiction. 


Case 1. If a¢B, then the definition of B implies a ¢ f(a), and since f(a)=B 
we have a¢B, which is a contradiction. 

Case 2. If a¢B, then the definition of B implies aé f(a), and since f(a)=B 
we have a€B, again a contradiction. 


Since the assumption that there is a surjection f:A— A(A) leads toa 
contradiction, we conclude that there are no such surjective functions. 

In conclusion, we have seen that there exists an injection A — YA(A) but 
no surjection A — Y(A), so Definition 13.4 implies that |A| <|A(A)]. | 


Beginning with the set A=N and applying Theorem 13.7 over and over 
again, we get the following chain of infinite cardinalities. 


Xo = IN| < IAIN) < LACAN) < |ACA(AWN))) < (13.2) 


Thus there is an infinite sequence of different types of infinity, starting 
with Xo and becoming ever larger. The set N is countable, and all the sets 
YAN), A(AWN)), etc., are uncountable. 

In the next section we will prove that |A(N)| =|R|. Thus |N| and |R| 
are the first two entries in the chain (13.2) above. They are are just two 
relatively tame infinities in a long list of other wild and exotic infinities. 

Unless you plan on studying advanced set theory or the foundations 
of mathematics, you are unlikely to ever encounter any types of infinity 
beyond No and |R|. Still you will in future mathematics courses need to 
distinguish between countably infinite and uncountable sets, so we close 
with two final theorems that can help you do this. 


Theorem 13.8 An infinite subset of a countably infinite set is countably 
infinite. 


Proof. Suppose A is an infinite subset of the countably infinite set B. 
Because B is countably infinite, its elements can be written in a list 
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b1,b2,63,b64,... Then we can also write A’s elements in list form by proceed- 
ing through the elements of B, in order, and selecting those that belong to 
A. Thus A can be written in list form, and since A is infinite, its list will 
be infinite. Consequently A is countably infinite. a 


Theorem 13.9 If UcCA, and U is uncountable, then A is uncountable. 


Proof. Suppose for the sake of contradiction that Uc A, and U is uncount- 
able but A is not uncountable. Then since UC A and U is infinite, then A 
must be infinite too. Since A is infinite, and not uncountable, it must be 
countably infinite. Then U is an infinite subset of a countably infinite set 
A, so U is countably infinite by Theorem 13.8. Thus U is both uncountable 
and countably infinite, a contradiction. a 


Theorems 13.8 and 13.9 can be useful when we need to decide whether 
a set is countably infinite or uncountable. They sometimes allow us to 
decide its cardinality by comparing it to a set whose cardinality is known. 

For example, suppose we want to decide whether or not the set A = R? 
is uncountable. Since the uncountable set U =R can be regarded as the 
x-axis of the plane R? (and thus a subset of R?), Theorem 13.9 implies that 
R? is uncountable. Other examples can be found in the exercises. 


Exercises for Section 13.3 


1. Suppose B is an uncountable set and A is a set. Given that there is a surjective 
function f : A — B, what can be said about the cardinality of A? 

2. Prove that the set C of complex numbers is uncountable. 

3. Prove or disprove: If A is uncountable, then |A| = |RI. 

4, Prove or disprove: If AS BCC and A and C are countably infinite, then B is 
countably infinite. 

5. Prove or disprove: The set {0,1} xR is uncountable. 

6. Prove or disprove: Every infinite set is a subset of a countably infinite set. 

7. Prove or disprove: If ACB and A is countably infinite and B is uncountable, 
then B-A is uncountable. 

8. Prove or disprove: If there is a injection f: A —B and a surjection g:A—B, 
then there is a bijection h: A — B. 

9. Prove that if A and B are finite sets with |A|=|B|, then any injection f:A—B 
is also a surjection. Show this is not necessarily true if A and B are not finite. 

10. Prove that if A and B are finite sets with |A|=|B|, then any surjection f:A—B 

is also an injection. Show this is not necessarily true if A and B are not finite. 
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13.4 The Continuum Hypothesis 


Let us now turn our attention to the cardinality of the set of real numbers. 
We established earlier in this chapter that Xo < |R|. For nearly a century 
after Cantor formulated his theories on infinite sets, mathematicians 
struggled with the question of whether or not there exists a set A for which 


No < |A| < |RI. 


It was commonly suspected that no such set exists, but no one was able 
to prove or refute this. The assertion that no such A exists came to be 
called the continuum hypothesis. The quest to prove (or disprove) the 
continuum hypothesis consumed many brilliant mathematicians until, in 
1964, Paul Cohen discovered a startling and unsettling truth about it. We 
will in this section discuss Cohen’s strange—almost paradoxical—discovery 
about the continuum hypothesis. 

But first we develop a few results concerning the cardinalities of in- 
tervals on the real line. This final section adopts a more informal tone, 
so that details do not obscure the bigger picture. (But make no mistake: 
Cohen’s proof is beyond our current scope. Though we will discuss his 
conclusion, the following results merely set the stage for this discussion.) 

Recall that a lemma is a proposition that is used as an aid in proving 
something else. The next two lemmas will help prove that |R| =| A(N)|. 


Lemma 13.1 The intervals [0,1] and (0,1) have the same cardinality. 
Proof. See the solution of Exercise 11 of Section 13.1. a 
Lemma 13.2 The sets [0,1] and R have the same cardinality. 


Proof. Notice that the open interval (0,1) has the same cardinality as R by 
virtue of the bijection fh: (0,1) — R, where h(x) = cot(zx). Thus |(0,1)| = |RI. 
But |[0,1]| = |(0,1)| by the previous lemma, so |[0,1]| = |RI. | 


Theorem 13.10 The sets R and A(N) have the same cardinality. 


Proof. By the previous lemma, |R| = |[0,1]|. Thus it suffices to prove that 
\[0, 1]| = | A(N)|. Therefore, we need to produce a bijection f :[0,1] — A(N) 
sending real numbers in [0,1] to subsets of N. 

It will be advantageous express real numbers in base 2 rather than 10. 
Recall that any real number in [0,1] has a unique binary decimal repre- 
sentation 0.b;b2b3b4..., where each b; is either 0 or 1. Numerically, 
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0.bibob3b4... = Le 


where the infinite series is seen to converse to a number in [0,1] by 
comparison to the convergent series 1° , ma =1. For example, we have 


0.0000000000... = 


| 
Me 


Ms 

mle Sle No 
I 
= 


~ 
Il 
BH 


0.1111111111... = 


H 


0.1100000000... 


l| 
+ 
| 


Nl F 
| Oo 


and so on. Define a function f :[0,1] — A(N) as 
f (0.b1b2b3b4...) — {n EN: by, = 1}. 


For example, f(0.00000...)=¢@, and f(0.111111...)=N, and f(0.10000...)= {1}. 
Also f(0.1010101010...) = {1,3,5,7,9,...}, and (0.010101010...) = {2,4,6,8,...}. 
Note f is surjective: For any X €« A(N), let 0.b1b2b3b4...€ [0,1] be such 
that b, =1ifneX and b, =0 if n¢X. Then f£(0.b1b2b3b4...)= X. 
Likewise f is injective: Suppose f(0.b1b2b3b4...) = f(0.ajaga3a4...)= X 
Then b, =a, is 1 or 0, depending on whether or not ne X, for any neN. 
Therefore 0.b1b2b3b4...=0.aja2a3a4... 
As we now have a bijection f :[0,1]— A(N), the proof is complete. 


We have just proved ||R| = |A(N)|. Let’s place this in the context of the 
chain (13.2) on page 226. We now have the following relationships. 


IR| 
I 
IN; < IAN! < |ACAWN)! < |\ACACAN))) < 


From this, we can see that the continuum hypothesis asserts that no set 
has a cardinality between that of N and its power set. 

Although this may seem intuitively plausible, it eluded proof since 
Cantor posed it in the 1880s. In fact, the real state of affairs is almost 
paradoxical. In 1931, the logician Kurt Godel proved that there exist 
true statements that cannot be proven true. Later he showed that the 
negation of the continuum hypothesis is such a statement. Together, these 
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two results implied that either (a) the continuum hypothesis is false and 
cannot be proven false, or (b) the continuum hypothesis is true. 

Some 20 years later, the mathematician Paul Cohen proved that the 
continuum hypothesis cannot be proved. 

Taken together, Godel’s and Cohen’s results mean that no logical conflict 
can arise from either asserting or denying the continuum hypothesis. We 
are free to either accept it as true or accept it as false, and the two choices 
lead to different—but equally consistent—versions of set theory. 

On the face of it, this seems to undermine the very foundation of 
logic, and hence everything we have done in this book. The continuum 
hypothesis should be a statement, that is, it should be definitely true or 
definitely false. How could it be both, or neither? 

Here is an analogy that may help make sense of this. Consider the 
number systems Z,. What if we asked whether [2] = [0] is true or false? Of 
course the answer depends on n. The expression [2] =[0] is true in Z2 and 
false in Z3. Moreover, if we assert that [2] =[0] is true, we are logically 
forced to to the conclusion that this is taking place in the system Zo. If 
we assert that [2]=[0] is false, then we are dealing with some other Z,,. 
The fact that [2] =[0] can be either true or false does not necessarily mean 
that there is some inherent inconsistency within the individual number 
systems Z,. The equation [2]=[0] is a true statement in the “universe” of 
Zz and a false statement in the universe of (say) Z3. 

It is the same with the continuum hypothesis. Saying it’s true leads to 
one system of set theory. Saying it’s false leads to some other system of 
set theory. Gédel’s and Cohen’s results imply that these two types of set 
theory, although different, are equally consistent and valid mathematical 
universes. 

So what should you believe? Fortunately, it really does not make much 
difference; most important results in mathematics do not hinge on the 
continuum hypothesis. Unless you are interested in a deep study of the 
foundations of mathematics, you will be fine accepting the continuum 
hypothesis as true. Most mathematicians are agnostics on this issue, but if 
pressed they lean toward the version of set theory in which the continuum 
hypothesis holds. 

The situation with the continuum hypothesis is a testament to the 
immense complexity of mathematics. It is a reminder of the importance of 
rigor and the careful, systematic methods of reasoning introduced in this 
book. 


Conclusion 


f you have internalized the ideas in this book, then you have a set 
of rhetorical tools for deciphering and communicating mathematics. 
These tools are indispensable at any level. But of course it takes more 
than mere tools to build something. Planning, creativity, inspiration, skill, 
talent, intuition, passion and persistence are also vitally important. It 
is safe to say that if you have come this far, then you probably possess a 
sufficient measure of these traits. 

The quest to understand mathematics has no end, but you are well 
equipped for the journey. It is my hope that the things you have learned 
from this book will lead you to a higher plane of understanding, creativity 
and expression. 

Good luck and best wishes. 


R.H. 


Solutions 


Chapter 1 Exercises 
Section 1.1 


{5x—1:x¢€ Z}={...—11,-6,-1,4,9, 14,19, 24, 29,...} 
{xe Z:-2<x< 7}={-2,-1,0,1,2,3, 4,5, 6} 
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{xe R: x2 = 3} = {-V3, V3} 

{ee R:x7 + 5x = —6} = {-2,-3} 

. {cE R:sinazx = 0} ={...,-2,-1,0,1,2,3,4,...}=Z 
- {xe Z: |x| <5} = {-4,—-3, -2,-1,0,1,2,3, 4} 

- {xeE Z:|6x| < 5} = {0} 
. (5a+2b:a,b€ Z}={...,—-2,—-1,0,1,2,3,...}=Z 

- {2,4,8,16,32,64...} = {2 :xeN} 

. {...,-6,—-3,0,3,6,9, 12, 15,...} = {8x:x€ Z} 

. {0,1,4,9, 16, 25,36,...}={x?:xeZ} 

A Ad ele se as ee ih 


+ We es gry lite a le” eZ) 
© desley I, Z,0, 5,0, 2m at 


-1, lly s 1} 


{it kez} 
- xe Z: |x| < 10}}=19 
» {1}, {2, {3,4}, O}1=1 85. [xe Z:x? <10}/=7 


2 {(x,y):x€ [1,2], ye [1,2]} 


37. [{xeN:x7<0}/=0 


43. {(x,y): |x| =2,y € [0, 1]} 


— 
bo 
wy 


233 


47, {(x,y):x,yER, y>=x?-1} 49. {((x,x+y):xER, ye Z} 
3A 


Section 1.2 
1. Suppose A = {1,2,3,4} and B = {a,c}. 
(a) Ax B= {(1,a),(1,c),(2,a),(2,c),(3,a),(3,c),(4,a),(4,¢)} 
(b) Bx A= {(a,1),(a, 2), (a,3), (a, 4), (c, 1),(c, 2), (c, 3), (ce, 4} 
(c) Ax A= {(1,1),(1,2), (1,3), (1, 4), (2, 1), (2, 2), (2, 3), (2,4), 
(3, 1), (3, 2), (3,3), (3,4), (4, 1), (4, 2), (4,3), (4, 4)} 
(d) Bx B= {(a,a),(a,c),(c,a),(c,c)} 
(e) @xB= {(a,b):a€ ,b € B}=@ (There are no ordered pairs (a,b) with a€ @.) 
(f) (AxB)xB= 
{((1,a),a),((1,¢), a), ((2,a), a), ((2, ¢), a), ((3, a), a), ((3, c),a),((4,a),a),((4,c), a), 
((1,@),c),((1,c),c), (2, a), c), (2, c), c), (3, a), c), (3, €), c), ((4,a), c), ((4, c), €)} 
(g) Ax(BxB)= 
{(1,(a,a)),(1,(a,c)),(1,(e,a)),(1,(c,¢)), 
(2,(a,a)), (2, (a, c)),(2,(c,a)),(2,(e,c)), 
(3,(a,a)), (3, (a, c)), (3, (c,a)), (3, (e,¢)), 
(4,(a,a)),(4,(a,c)),(4,(c,a)),(4,(c, c))} 


(h) B® = {(a,a,a),(a,a,c),(a,c,a),(a,c,c),(c,a,a),(c,a,c),(c,c,a),(c,c, c)} 
8. {xe R:x? =2}x {a,c,e}= {(-V2,a),(V2,a),(-V2,c), (V2, c), (-V2, e), (V2, e)} 
5. {xe Rix? =2} x {xe R: |x| =2}= {(-V2,-2),(V2,2),(-V2,2), (V2, -2)} 
7. {D} x {0,D} x {0,1} = ((G,0,0),(G, 0, 1),(G, 0), (D, D, L)} 
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Sketch the following Cartesian products on the x-y plane. 


9. {1,2,3} x {-1,0,1} 15. {1} x [0,1] 
2 2 
1 e e . 1 
3 -2 -1 1 2 3 3 -2 -1 1 2 3 
-1 . e e -1 
-~2 -2 
11. [0,1] x [0,1] 17. NxZ te gt es 
2 2 ° ° ° 
1 1 ° ° ° 
= eal ee ee ae oe a ee ae 
-1 -1 ° ° ° 
-2 —-2 ° ° ° 
13. {1,1.5, 2} [1,2] 19. [0,1] x [0, 1] x [0, 1] 
LT 
1 
3 -2 -1 1 2 3 
= 
-2 


Section 1.3 
A. List all the subsets of the following sets. 
1. The subsets of {1,2,3,4} are: {}, {1}, {2}, {3}, {4}, {1,2}, {1,3}, {1,4}, {2,3}, {2,4}, 
{3,4}, {1,2,3}, {1,2,4}, {1,3,4}, {2,3,4}, {1,2,3,4}. 
3. The subsets of {{R}} are: {} and {{R}}. 
5. The subsets of {Z} are {} and {d}. 
7. The subsets of {R,{Q,N}} are (}, {(R},{{Q,N}, (R, {Q, N}}. 


B. Write out the following sets by listing their elements between braces. 
9. {X:X <{3,2,a} and |X|=2} = {{8,2},{3,a}, {2,a}} 
11. {X:X<{3,2,a} and |X|=4} =§=@ 


C. Decide if the following statements are true or false. 
13. R° CR is true because any set is a subset of itself. 


15. {(x,y):x-1=0} ¢ {(x, y):x2-x=0}. This is true. (The even-numbered ones 
are both false. You have to explain why.) 
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Section 1.4 
A. Find the indicated sets. 


1. A({{a, b}, {c}}) = (0, {{a, b}, {ich}, {{a, b}, {c}}} 
3. A ({{D},5}) = (0, {Dh}, {5}, ((9},5} 
5. P(A ({2})) = {D, {O}, {{2}}, {D, {2H} 
7. PAl({a,b}) x AO, I) = 
{ (0,9), (D, {0}), (2, {1}), (2, {0, 1}), 
({a},), — ({a}, {0}), fa}, {1}), {a}, {0, 1), 
({b},0), {DB}, {0}), 1B}, {1}, CB}, {0, 13), 
({a,b},0), ({a,b},{0}), ({a,b},{1}), Ca,b},{0,1}) } 
9. A {a,b} x {0}) = {G, {((a, 0)}, (6, 0)}, {(a, 0), (6, O}} 
11. (X c A({1,2,3}): |X| <= 
{D,{O}, {{1}}, 12H}, {3}, (1, 2, (1, 3h, 12, 3}, (1,2, 3}} 


B. Suppose that |A|=m and |B|=n. Find the following cardinalities. 


13. |A AA Ay = 22°”) 

15. |A(A x B)| = 2" 

17. {xX e« AA): |X|< l= m4+1 

19. |AAAA x 6) =|A APA =4 


Section 1.5 


1. Suppose A = {4,3,6,7,1,9}, B = {5,6,8,4} and C = {5,8,4}. Find: 
(a) AUB = {1,3,4,5,6,7,8,9} (f) ANC= {4} 
(b) ANB = {4,6} 
(c) A-B= {3,7,1,9} 
(d) A-—C = {3,6,7,1,9} 
(e) B—A= {5,8} (i) C-B=@ 


(g) BNC = {5,8,4} 
(h) BUC = {5,6,8, 4} 


3. Suppose A = {0,1} and B = {1,2}. Find: 
(a) (Ax B)n(BxB)= (1, 1),(1,2)} 
(b) (Ax B)U(Bx B)= (0,1), (0,2), (1, 1), (1, 2), (2, 1), (2, 2)} 


(c) (Ax B)-(Bx B)= {(0,1),(0,2)} (f) AA)n AB) = {9, {1} 
(d) (AnB)xA= {(1,0),(1, D} (g) P(A)— A(B)= {{0}, (0, 1} 
(e) (AxB)nB=¢ (h) AANB)= {fh}, {lh} 


(i) {%,{(0, D}, (0, 2)}, (A, Dh, (2, 2h, (0, D, (0, 2)}, (0, D, (1, Dt, (0, D, (1, 2)}, (0, 2), 1, Dh, 
{(0, 2), (1, 2)}, (1, 1), (1, 2)}, (0, 2), (1, 1), (1, 2)}, (0, 1), 1, 1), (1, 2)}, (0, 1), (0, 2), (1, 2)}, 
{(0, 1), (0,2), (1, D}, (0, 1), (0, 2), (1, D, (1, 2)}} 


236 Solutions 


5. Sketch the sets X =[1,3]x[1,3] and Y =[2,4]x [2,4] on the plane R?. On separate 
drawings, shade in the sets X UY, XnNY, X-Y and Y—-X. (Hint: X and Y are 
Cartesian products of intervals. You may wish to review how you drew sets 
like [1,3] x [1,3] in the Section 1.2.) 


4 4 4 4 4 
Y-X 
3 3 — 3 
XUY 
2 2 2 2 2 — 
x XnNY x-Y 

1 1 1 1 1 
I is ae Al eel g I sg 
1 2 3 4 12 3 4 12 3 4 12 3 4 12 3 4 


7. Sketch the sets X = {(x,y)€R?:x7+y? <1} and Y = {(x,y)eR?:x=0} on R?. On 
separate drawings, shade in the sets XUY, XNY, X-Y and Y-X. 


2 
Y a 
> < 


> < 
By -2 - lee -2 —1 


9. The first statement is true. (A picture should convince you; draw one if 
necessary.) The second statement is false: Notice for instance that (0.5,0.5) is 
in the right-hand set, but not the left-hand set. 


Section 1.6 
1. Suppose A = {4,3,6,7,1,9} and B = {5,6,8,4} have universal set U = {ne Z:0<n< 10}. 


(a) A= {0,2,5,8, 10} (f) A-B= {4,6} 
(b) B = {0,1,2,3,7,9, 10} (g) A-B= 5,8} 
(ec) ANA=@ i ae 

(d) AUA= (0,1,2,3,4,5,6,7,8,9,10) =U ‘ BOBS eS) 


(e) A-A=A (i) ANB =({0,1,2,3,4,6,7,9, 10} 


3. Sketch the set X =[1,3]x [1,2] on the plane R?. On separate drawings, shade in 
the sets X, and Xn (0,21 x [0,3]). 


3 3 3 

2 2 2 
x 

1 1 1 


x _ < > 

ee 1 | =o a 

-1y Xn ((0,2] x [0,3]) 

5. Sketch the set X = {(x,y)€R?:1<x*+y” <4} on the plane R’. On a separate 
drawing, shade in the set X. 
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ors "6 


 ; a 
Ww, : | A (shaded 


Solution of 1.6, #5. Solution of 1.7, #1. 


Section 1.7 


1. 
3. 


11. 
13. 


Draw a Venn diagram for A. (Solution above right) 
Draw a Venn diagram for (A—B)NC. 
Scratch work is shown on the right. The 
set A-—B is indicated with vertical shading. C C 
The set C is indicated with horizontal shad- 
ing. The intersection of A—B and C is thus ; 
the overlapping region that is shaded with 7 


both vertical and horizontal lines. The final 
answer is drawn on the far right, where the A 
set (A-—B)nC is shaded in gray. 


A 


. Draw Venn diagrams for AU(BNC) and (AUB)n(AUC). Based on your drawings, 


do you think A U(BNC) = (AUB)n(AUC)? 
If you do the drawings carefully, you will find 


C 
that your Venn diagrams are the same for both 
AU(BNC) and (AUB)n(AUC). Each looks as 
illustrated on the right. Based on this, we are 
inclined to say that the equation AU(BNC) = 
(AUB)n(AUC) holds for all sets A, B and C. A 


. Suppose sets A and B are in a universal set U. Draw Venn diagrams for AnB 


and AUB. Based on your drawings, do you think it’s true that ANB = AUB? 


The diagrams for ANB and AUB look exactly 


alike. In either case the diagram is the shaded | 
region illustrated on the right. Thus we would : 
expect that the equation ANB = AUB is true 


for any sets A and B. 


. Draw a Venn diagram for (An B)-C. 


The simplest answer is (BNC)-A. 
One answer is (AUBUC)-(AnNBnC). 
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Section 1.8 


1. 


11. 
13. 


Suppose A, = {a,b,d,e,g,f}, Ag ={a,b,c,d}, A3 = {b,d,a} and Aq = {a,b,h}. 
4 4 


(a) UA; = {a,b,c,d,e,f,g,h} (b) () A; = {a,b} 
i=l i=l 
. For each neéN, let Ap = {0,1,2,3,...,n}. 
(a) UA; = (O}UN (b) ()Ai= {0,1} 
TEN ieN 
~ (a) Uli,i+1) =[1,00) (b) ()li,i+ll=0 
ieN ieN 
. (a) URx[i,i+1]= {(%,y):x,yeR y= U (b) ()Rxfi,i+ll=¢ 
ieN ieN 
(a U X=N (b) (]) X=¢ 
Xe PAIN) Xe AN) 


Yes, this is always true. 
The first is true, the second is false. 


Chapter 2 Exercises 
Section 2.1 


Decide whether or not the following are statements. In the case of a statement, 
say if it is true or false. 


11. 


13. 


15. 


. Every real number is an even integer. (Statement, False) 
. If x and y are real numbers and 5x =5y, then x= y. (Statement, True) 


1 
3 
5. 
7 


Sets Z and N are infinite. (Statement, True) 


. The derivative of any polynomial of degree 5 is a polynomial of degree 6. 


(Statement, False) 


- cos(x)=—-1 


This is not a statement. It is an open sentence because whether it’s true or 
false depends on the value of x. 


The integer x is a multiple of 7. 
This is an open sentence, and not a statement. 


Either x is a multiple of 7, or it is not. 
This is a statement, for the sentence is true no matter what x is. 


In the beginning God created the heaven and the earth. 

This is a statement, for it is either definitely true or definitely false. There is 
some controversy over whether it’s true or false, but no one claims that it is 
neither true nor false. 
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Section 2.2 


Express each statement as one of the forms PAQ, PvQ, or ~P. Be sure to also 
state exactly what statements P and Q stand for. 


1. The number 8 is both even and a power of 2. 
PaAQ 
P: 8 is even 
Q: 8 is a power of 2 


3. xZ#y ~(x=y) (Also ~ P where P:x= y.) 
5. y>x ~(y <x) (Also ~ P where P: y <x.) 
7. The number x equals zero, but the number y does not. 
PA~Q 
P:x=0 
Q:y=0 
9. xe A-B 
(x€ A)A ~ (x € B) 


11. Ac {Xe AN): |X| <oo} 
(A SN)A(JA| <oo). 
13. Human beings want to be good, but not too good, and not quite all the time. 
PA~QA~R 
P.: Human beings want to be good. 
@ : Human beings want to be too good. 
R: Human beings want to be good all the time. 


Section 2.3 
Without changing their meanings, convert each of the following sentences into a 
sentence having the form “If P, then Q.” 


1. A matrix is invertible provided that its determinant is not zero. 
Answer: If a matrix has a determinant not equal to zero, then it is invertible. 
3. For a function to be integrable, it is necessary that it is continuous. 
Answer: If function is integrable, then it is continuous. 
5. An integer is divisible by 8 only if it is divisible by 4. 
Answer: If an integer is divisible by 8, then it is divisible by 4. 
7. A series converges whenever it converges absolutely. 
Answer: If a series converges absolutely, then it converges. 
9. A function is integrable provided the function is continuous. 
Answer: If a function is continuous, then that function is integrable. 
11. You fail only if you stop writing. 
Answer: If you fail, then you have stopped writing. 


13. Whenever people agree with me I feel I must be wrong. 
Answer: If people agree with me, then I feel I must be wrong. 
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Section 2.4 
Without changing their meanings, convert each of the following sentences into a 
sentence having the form “P if and only if Q.” 


1. For a matrix to be invertible, it is necessary and sufficient that its determinant 
is not zero. 
Answer: A matrix is invertible if and only if its determinant is not zero. 

3. If xy =0 then x=0 or y=0, and conversely. 
Answer: xy=0 if and only if x=0 or y=0 

5. For an occurrence to become an adventure, it is necessary and sufficient for 
one to recount it. 
Answer: An occurrence becomes an adventure if and only if one recounts it. 


Section 2.5 
1. Write a truth table for Pv(Q=>R)_ 5. Write a truth table for (PA ~P)vQ 


P\Q/|R|IQ>R | PVQ=>R) P|Q || (Pa~P) || (PA~P)vQ 
T|\|T|T T T T | T F T 
T|\|T\|F F T T|F F F 
T|\|F\|\T T T F\T F T 
T|F\F T T F\F F F 
F\T|T T T 
FITIF F F 7. Write a truth table for (PA ~P)>Q 
cla a = P]Q] (@a~P)] @a~P)>@ 
F\F\F T T 
T | T F T 
3. Write a truth table for ~(P > Q) T|F F T 
F\T F T 
P\|\Q/|P>Q]~@>Q) FIF F T 
T|T T F 
T\|F F T 
F\T T F 
F\F T F 
9. Write a truth table for ~(~PA~Q). 
P|\|Q| ~P | ~Q@ |~Pv~Q]) ~(~Pa~Q) 
T|T F F F T 
T\F F T T F 
F\T T F T F 
F\F T T T F 
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11. Suppose P is false and that the statement (R > S)@(P AQ) is true. Find the 
truth values of R and S. (This can be done without a truth table.) 
Answer: Since P is false, it follows that (P \Q) is false also. But then in order 
for (R > S)@(P AQ) to be true, it must be that (R > S) is false. The only way 
for (R = S) to be false is if} R is true and S is false. 


Section 2.6 
A. Use truth tables to show that the following statements are logically equivalent. 
1. PA(QQVR)=(PAQ)V(PAR) 


av 


PAQvR) | (PAQ)v(PAR) 
T T 


me) 
& 


QvR | P 
T 


© 
v 


ARR SRA 
SS SA] SPA |] > 


WY)", 
PSPS] 
ol ae Bo ae BER a BE a BI») 
Sy) Sy Sy SSA] > 
SS Se) 
PS Se) 


zy 
zy 


Thus since their columns agree, the two statements are logically equivalent. 
3. P>Q=(~P)vQ 


P|Q/|~P | (}@P)vQ/|P>Q 
T|T | F T T 
T|F| F F F 
F\|T | 7 T T 
F\F]T T T 


Thus since their columns agree, the two statements are logically equivalent. 
5. ~(PVQVR) = (~P)A(~Q)A(~ BR) 


P\/Q|R|PvVQvR/|~P|~Q|~R]] ~PvQvVR) | (~P)A~QA~R) 
T|T|T T F F F F F 
T|T|F T F F T F F 
T\|F|T T F T F F F 
T|F\F T F T T F F 
F/T|T T T F F F F 
F\T\F T T F T F F 
FF \|T T T T F F F 
F\F\F F T T T T T 


Thus since their columns agree, the two statements are logically equivalent. 
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7 P>Q = (PA~Q)>(QAr~Q) 


P/Q | ~Q | PA~Q | QA~Q || (PA~Q)>@QA~Q) | P= Q 
aE NG ae F F T T 
T|F | T sy F F F 
Foal Sai 2 F F T T 
F\|F| T F F T T 


Thus since their columns agree, the two statements are logically equivalent. 


B. Decide whether or not the following pairs of statements are logically equivalent. 


9. By DeMorgan’s law, we have ~ (~ Pv ~Q)=~~PA~~Q=PAQ. Thus the 
two statements are logically equivalent. 


11. (~P)A(P>Q) and ~(Q =P) 


P\|Q|~P|P>Q/|Q=>P ](~P)AP=Q) | ~Q=>P) 
T\|T| F T T F F 
T|F| F F T F F 
F\|T| T T F T T 
F\F | T T Pp T F 


The columns for the two statements do not quite agree, thus the two state- 
ments are not logically equivalent. 


Section 2.7 


Write the following as English sentences. Say whether the statements are true 
or false. 


1. 


VxER,x?2>0 

Answer: For every real number x, x? >0. 

Also: For every real number x, it follows that x? > 0. 

Also: The square of any real number is positive. (etc.) 

This statement is FALSE. Reason: 0 is a real number, but it’s not true that 
07 >0. 


- JAER,VxER,ax=x. 


Answer: There exists a real number a for which ax =x for every real number 
x. 
This statement is TRUE. Reason: Consider a = 1. 


. VneN, AX € AN), |X| <n 


Answer: For every natural number n, there is a subset X of N with |X| <n. 
This statement is TRUE. Reason: Suppose neN. Let X =@. Then |X|=0<n. 
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7 VXCN,IneZ,|X|=n 
Answer: For any subset X of N, there exists an integer n for which |X| =n. 
This statement is FALSE. For example, the set X = {2,4,6,8,...} of all even 
natural numbers is infinite, so there does not exist any integer n for which 
|[X|=n. 

9. VneZ,AméeZm=nt+5 
Answer: For every integer n there is another integer m such that m=n+5. 
This statement is TRUE. 


Section 2.9 


Translate each of the following sentences into symbolic logic. 


1. If f is a polynomial and its degree is greater than 2, then /’ is not constant. 
Translation: (P \Q)=>R, where 
P:f is a polynomial, 
Q:f has degree greater than 2, 
R:f' is not constant. 


3. If x is prime then \/x is not a rational number. 
Translation: P >~Q, where 
P:x is prime, 
Q:/x is a rational number. 


5. For every positive number e¢, there is a positive number 6 for which |x-a| <6 
implies |f(x)— f(a)| <e. 
Translation: V ¢€R,é>0,16€R,6 >0,(|x—-a| < 6) => (|f(x)— f(a)| < €) 


7. There exists a real number a for which a+x =x for every real number x. 
Translation: Ja€éR,VxeERat+x=x 


9. If x is a rational number and x 40, then tan(x) is not a rational number. 
Translation: ((x € Q) A (x 4 0)) => (tan(x) € Q) 


11. There is a Providence that protects idiots, drunkards, children and the United 
States of America. 


One translation is as follows. Let R be union of the set of idiots, the set of 
drunkards, the set of children, and the set consisting of the USA. Let P be the 
open sentence P(x): x is a Providence. Let S be the open sentence S(x,y): x 
protects y. Then the translation is 4x,V y¢€ R,P(x)A S(x,y). 


(Notice that, although this is mathematically correct, some humor has been 
lost in the translation.) 


13. Everything is funny as long as it is happening to somebody else. 
Translation: V x, ~ H(x)=> F(x), 
where H(x): x is happening to me, and F(x): x is funny. 
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Section 2.10 


Negate the following sentences. 


1. The number x is positive, but the number y is not positive. 
The “but” can be interpreted as “and.” Using DeMorgan’s law, the negation is: 
The number x is not positive or the number y is positive. 


3. For every prime number p there, is another prime number gq with q > p. 
Negation: There is a prime number p such that for every prime number q, 
q<p. 

Also: There exists a prime number p for which q < p for every prime number q. 
(etc.) 


5. For every positive number e« there is a positive number M for which |f(x)—b| <e 
whenever x > M. 
To negate this, it may be helpful to first write it in symbolic form. The statement 
is Ve € (0,00), 4M € (0,00), (x > M) => (|f(x)- b| < €). 
Working out the negation, we have 


~ (We € (0,00), AM € (0,00), (x > M) > (If (x) - b| < e)) 
de € (0,00), ~ (AM € (0,00), (x > M) => (f(x) - b| < e)) 
de € (0,00), VM € (0,00), ~ ((x > M) > ([f (x) — b| < e)). 


Finally, using the idea from Example 2.14, we can negate the conditional 
statement that appears here to get 


de € (0,00), VM € (0,00), Ax, (x > M)A ~ (If (x) - b| < e)). 


Negation: There exists a positive number e€ with the property that for every 
positive number M, there is a number x for which x>M and |f(x)—b|=e. 


7. I don’t eat anything that has a face. 
Negation: I will eat some things that have a face. 
(Note. If your answer was “I will eat anything that has a face.” then that is 
wrong, both morally and mathematically.) 


9. If sin(x) <0, then it is not the case that O<x<nz. 
Negation: There exists a number x for which sin(x)<0 and 0<x<n7. 


11. You can fool all of the people all of the time. 


There are several ways to negate this, including: 

There is a person that you can’t fool all the time. or 

There is a person x and a time y for which x is not fooled at time y. 
(But Abraham Lincoln said it better.) 
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Chapter 3 Exercises 
Section 3.1 


1. Consider lists made from the letters T, H, E, O, R, Y, with repetition allowed. 


(a) How many length-4 lists are there? Answer: 6-6-6-6= 1296. 

(b) How many length-4 lists are there that begin with T? 
Answer: 1:6-6-6= 216. 

(c) How many length-4 lists are there that do not begin with T? 
Answer: 5-6-6-6= 1080. 


. How many ways can you make a list of length 3 from symbols 4,8,c,pD,£,F if... 
(a) ... repetition is allowed. Answer: 6-6-6= 216. 
(b) ... repetition is not allowed. Answer: 6-5-4= 120. 
(ec) ... repetition is not allowed and the list must contain the letter a. 
Answer: 5:4+5-4+5-4= 60. 
(d) ... repetition is allowed and the list must contain the letter a. 
Answer: 6-6-6—5-5-5= 91. 


(Note: See Example 3.2 if a more detailed explanation is required.) 


. Five cards are dealt off of a standard 52-card deck and lined up in a row. How 
many such line-ups are there in which all five cards are of the same color? (i.e., 
all black or all red.) 

There are 26-25-24-23-22 = 7,893,600 possible black-card line-ups and 26-25-24-23.- 
22 = 7,893,600 possible red-card line-ups, so the answer is 7,893, 600+7, 893,600 = 
15, 787,200. 


. This problems involves 8-digit binary strings such as 10011011 or 00001010. 

(i.e., 8-digit numbers composed of 0’s and 1’s.) 

(a) How many such strings are there? Answer: 2-2-2-2-2-2-2-2-= 256. 

(b) How many such strings end in 0? Answer: 2-2-2-2-2-2-2-1-= 128. 

(c) How many such strings have the property that their second and fourth 
digits are 1’s? Answer: 2-1-2-1-2-2-2-2-= 64. 

(d) How many such strings are such that their second or fourth digits are 1’s? 
Answer: These strings can be divided into four types. Type 1 consists of 
those strings of form «1*0* «*, Type 2 consist of strings of form +0 * 1 «x, 
and Type 8 consists of those of form +*1*1%****. By the multiplication 
principle there are 2° = 64 strings of each type, so there are 3-64=192 
8-digit binary strings whose second or fourth digits are 1’s. 


. This problem concerns 4-letter codes that can be made from the letters of the 
English Alphabet. 


(a) How many such codes can be made? Answer: 26-26-26-26 = 456976 
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(b) How many such codes have no two consecutive letters the same? 
We use the multiplication principle. There are 26 choices for the first letter. 
The second letter can’t be the same as the first letter, so there are only 25 
choices for it. The third letter can’t be the same as the second letter, so there 
are only 25 choices for it. The fourth letter can’t be the same as the third letter, 
so there are only 25 choices for it. Thus there are 26-25-25 -25 = 406,250 
codes with no two consecutive letters the same. 


11. This problem concerns lists of length 6 made from the letters A,B,C\D,E,F,G,H. 

How many such lists are possible if repetition is not allowed and the list 
contains two consecutive vowels? 
Answer: There are just two vowels A and E to choose from. The lists we want 
to make can be divided into five types. They have one of the forms VV * « * «, 
or *VVxx«x*, or **VVx%*, or ** *VVx, or ****VV, where V indicates a 
vowel and « indicates a consonant. By the multiplication principle, there are 
2-1-6-5-4-3 = 720 lists of form VV «« **. In fact, that for the same reason there 
are 720 lists of each form. Thus the answer to the question is 5-720 = 3600 


Section 3.2 


1. Answer n= 14. 5. 0 = Bones! = 120-119 = 14,280. 
3. Answer: 5! = 120. 7. Answer: 5!4! = 2880. 


9. The case x=1 is straightforward. For x =2,3 and 4, use integration by parts. 
For x=, you are on your own. 


Section 3.3 


1. Suppose a set A has 37 elements. How many subsets of A have 10 elements? 
How many subsets have 30 elements? How many have 0 elements? 

Answers: (°4) = 348,330,136; (3°) = 10,295,472; (°7) = 1. 

3. A set X has exactly 56 subsets with 3 elements. What is the cardinality of X? 
The answer will be n, where (3) = 56. After some trial and error, you will 
discover (8) = 56, so |X|=8. 

5. How many 16-digit binary strings contain exactly seven 1’s? 

Answer: Make such a string as follows. Start with a list of 16 blank spots. 
Choose 7 of the blank spots for the 1’s and put 0’s in the other spots. There 
are (16) = 114,400 ways to do this. 

7. |{X € P({0,1,2,3,4,5,6,7,8, 9}) |X| < 43] = (40) + (40) + (2) + (9) = 1410+45+120 = 
176. 

9. This problem concerns lists of length six made from the letters A,B,C,D,E,F, 

without repetition. How many such lists have the property that the D occurs 
before the A? 
Answer: Make such a list as follows. Begin with six blank spaces and select 
two of these spaces. Put the D in the first selected space and the A in the 
second. There are (§) = 15 ways of doing this. For each of these 15 choices 
there are 4! = 24 ways of filling in the remaining spaces. Thus the answer to 
the question is 15 x 24= 360 such lists. 


11. 


13. 
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How many 10-digit integers contain no 0’s and exactly three 6’s? 
Answer: Make such a number as follows: Start with 10 blank spaces and choose 
three of these spaces for the 6’s. There are (1?) = 120 ways of doing this. For 
each of these 120 choices we can fill in the remaining seven blanks with choices 
from the digits 1,2,3,4,5,7,8,9, and there are 8’ to do this. Thus the answer to 
the question is (1?)-8’ = 251,658, 240. 

! 


A ! ! 
Assume n,keN with k=n. Then (7) = Gp = AG Gaia = lara): 


Section 3.4 


1. 


11. 


Write out Row 11 of Pascal’s triangle. 
Answer: 1 11 55 165 330 462 462 330 165 55 11 1 


. Use the binomial theorem to find the coefficient of x® in (x +2). 


Answer: According to the binomial theorem, the coefficient of x®y° in («+ y)!® 
is (13) x8y° = 1287x°y°. Now plug in y= 2 to get the final answer of 41184x°. 


. Use the binomial theorem to show Y7_,(%) = 2”. Hint: Observe that 2” =(1+1)". 


Now use the binomial theorem to work out (x+ y)” and plug in x=1 and y=1. 


. Use the binomial theorem to show 57_, 3° (7) =4”. 


Hint: Observe that 4” =(1+3)". Now look at the hint for the previous problem. 


. Use the binomial theorem to show (§)-(7)+(5)-(3)+(4)-(3) +---£(4) =0. 


Hint: Observe that 0=0” =(1+(-1))”. Now use the binomial theorem. 


Use the binomial theorem to show 9” = Lp_(-? (;)10"-*. 
Hint: Observe that 9” =(10+(-1))”. Now use the binomial theorem. 


13. Assume n > 3. Then (5) = es) - C3.) = Cs) +("5’) Hr Ce) a (3) a (3) Sc tier 
Section 3.5 
1. At a certain university 523 of the seniors are history majors or math majors 


(or both). There are 100 senior math majors, and 33 seniors are majoring in 
both history and math. How many seniors are majoring in history? 

Answer: Let A be the set of senior math majors and B be the set of senior 
history majors. From |AUB|=|A|+|B|—|ANB| we get 523 = 100+ |B|-—33, so 
|B| =523+33-100=456. There are 456 history majors. 


. How many 4-digit positive integers are there that are even or contain no 0’s? 


Answer: Let A be the set of 4-digit even positive integers, and let B be the 
set of 4-digit positive integers that contain no 0’s. We seek |AUB|. By the 
multiplication principle |A| = 9-10-10-5 = 4500. (Note the first digit cannot be 0 
and the last digit must be even.) Also |B| = 9-9-9-9 = 6561. Further, ANB consists 
of all even 4-digit integers that have no 0’s. It follows that |ANB| =9-9-9-4 = 2916. 
Then the answer to our question is |AUB| = |A|+|B|—|AnB| = 4500+6561-2916 = 
8145. 
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5. How many 7-digit binary strings begin in 1 or end in 1 or have exactly four 1’s? 


Answer: Let A be the set of such strings that begin in 1. Let B be the set of such 
strings that end in 1. Let C be the set of such strings that have exactly four 1’s. 
Then the answer to our question is |AUBUC|. Using Equation (3.4) to compute 
this number, we have |AUBUC| = |A|+|B|+|C|-|AnNB|-|ANC|-|BnC|+|ANBnC| = 
26 + 26 + (7) 25 - (§) — (8) + (3) = 64+64+ 35-32-20-20+10= 101. 

. This problem concerns 4-card hands dealt off of a standard 52-card deck. How 
many 4-card hands are there for which all four cards are of the same suit or 
all four cards are red? 

Answer: Let A be the set of 4-card hands for which all four cards are of the 
same suit. Let B be the set of 4-card hands for which all four cards are red. 
Then ANB is the set of 4-card hands for which the four cards are either all 
hearts or all diamonds. The answer to our question is |A UB| = |A|+|B|—|AnB|= 
4 (43) + (26) -2 (43) = 2( 43) + (78) = 1430+ 14950 = 16380. 

. A 4-letter list is made from the letters L,J,S,T,E,D according to the following 
rule: Repetition is allowed, and the first two letters on the list are vowels or 
the list ends in D. 

Answer: Let A be the set of such lists for which the first two letters are 
vowels, so |A| =2-2-6-6=144. Let B be the set of such lists that end in D, so 
|B|=6-6-6-1=216. Then ANB is the set of such lists for which the first two 
entries are vowels and the list ends in D. Thus |AnB|=2-2-6-1= 24. The 
answer to our question is |AUB|=|A|+|B|—|ANB|= 144+216-24= 336. 


Chapter 4 Exercises 


1. If x is an even integer, then x? is even. 


Proof. Suppose x is even. Thus x = 2a for some aé€ Z. 

Consequently x? = (2a)? = 4a? = 2(2a”). 

Therefore x? = 2b, where b is the integer 2a”. 

Thus x? is even by definition of an even number. a 


. If a is an odd integer, then a? +3a+5 is odd. 


Proof. Suppose a is odd. 

Thus a=2c+1 for some integer c, by definition of an odd number. 

Then a? +3a+5=(2c+1)?+3(2c+1)+5= 4c? +4c+1+6c+3+5=4c7+10c+9 

= 4c? +10c+8+1=2(2c? +5e+4)+1. 

This shows a?+3a+5=2b+1, where b =2c?+5c+4€Z. 

Therefore a? +3a+5 is odd. | 


. Suppose x,ye Z. If x is even, then xy is even. 


Proof. Suppose x,y «€ Z and x is even. 

Then x = 2a for some integer a, by definition of an even number. 

Thus xy = (2a)(y) = 2(ay). 

Therefore xy = 2b where 0 is the integer ay, so xy is even. a 


7. 


11. 


13. 
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Suppose a,be Z. If a|b, then a? | b?. 


Proof. Suppose a |b. 

By definition of divisibility, this means 6 =ac for some integer c. 
Squaring both sides of this equation produces b? = a?c?. 

Then b? =a?d, where d=c’€Z. 


By definition of divisibility, this means a? | b?. a 


. Suppose a is an integer. If 7| 4a, then 7|a. 


Proof. Suppose 7 | 4a. 

By definition of divisibility, this means 4a = 7c for some integer c. 

Since 4a = 2(2a) it follows that 4a is even, and since 4a = 7c, we know 7c is even. 
But then c can’t be odd, because that would make 7c odd, not even. 

Thus c is even, so c = 2d for some integer d. 

Now go back to the equation 4a =7c and plug in c = 2d. We get 4a = 14d. 
Dividing both sides by 2 gives 2a = 7d. 

Now, since 2a = 7d, it follows that 7d is even, and thus d cannot be odd. 
Then d is even, so d = 2e for some integer e. 

Plugging d = 2e back into 2a=7d gives 2a = 14e. 

Dividing both sides of 2a = 14e by 2 produces a= 7e. 

Finally, the equation a = 7e means that 7|a, by definition of divisibility. a 


Suppose a,b,c,deé Z. If a|b and c|d, then ac| bd. 


Proof. Suppose a|b and c|d. 

As a|b, the definition of divisibility means there is an integer x for which 
b=ax. 

As c|d, the definition of divisibility means there is an integer y for which 
d=cy. 

Since 6 =ax, we can multiply one side of d=cy by 6b and the other by ax. 
This gives bd =axcy, or bd =(ac)(xy). 

Since xy € Z, the definition of divisibility applied to bd = (ac)(xy) givesac|bd. 


Suppose x, yeER. If x7+5y=y2+5x, then x=y or x+y=5. 


Proof. Suppose x? +5y = y?+5x. 

Then x? -— y2 =5x—5y, and factoring gives (x— y\(x+y) =5(x—y). 

Now consider two cases. 

Case 1. If x—y #0 we can divide both sides of (x — y)(x+ y) = 5(x— y) by the 
non-zero quantity x—y to get x+ y=5. 

Case 2. If x— y=0, then x= y. (By adding y to both sides.) 

Thus x=y or x+y=5. a 
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15. 


17. 


19. 


21. 


23. 


If ne Z, then n?+3n+4 is even. 


Proof. Suppose ne Z. We consider two cases. 

Case 1. Suppose n is even. Then n = 2a for some ae Z. 

Therefore n? +3n +4 = (2a)? + 3(2a)+4 = 4a? + 6a + 4 = 2(2a? + 3a +2). 

So n?+3n+4=2b where b = 2a2+3a+2€Z, so n*+3n+4 is even. 

Case 2. Suppose n is odd. Then n= 2a+1 for some ae Z. 

Therefore n?7+3n+4=(2a+1)?+3(2a+1)+4=4a7+4a+1+6a+3+4=4a7+10a+8 
= 2(2a7 +5a+4). So n2+3n+4=2b where b = 2a72+5a+4€ Z, so n2+3n+4 is even. 


In either case n?+3n+4 is even. a 
If two integers have opposite parity, then their product is even. 


Proof. Suppose a and 6 are two integers with opposite parity. Thus one is even 
and the other is odd. Without loss of generality, suppose a is even and b is 
odd. Therefore there are integers c and d for which a = 2c and b=2d+1. Then 
the product of a and 6b is ab = 2c(2d+1)=2(2cd +c). Therefore ab = 2k where 
k=2cd+ceZ. Therefore the product ab is even. a 


Suppose a,b,c € Z. If a?|b and b®|c then a® | c. 


Proof. Since a?|b we have b = ka? for some k € Z. Since b? | c we have c= hb? 
for some hf Z. Thus c= h(ka?)? = hk?a®. Hence a® | c. a 


If p is prime and 0<k<p then p|(?). 


Proof. From the formula (?) = ata: we get p! =(?)(p—k)!k!. Now, since the 
prime number p is a factor of p! on the left, it must also be a factor of (7)(p—k)!k! 
on the right. Thus the prime number p appears in the prime factorization of 
(2)(p —k)IR!. 

Now, k! is a product of numbers smaller than p, so its prime factorization 
contains no p’s. Similarly the prime factorization of (p—k)! contains no p’s. 
But we noted that the prime factorization of (2)(p —k)!k! must contain a p, so it 
follows that the prime factorization of (2) contains a p. Thus (?) is a multiple 
of p, so p divides (?). | 


If n€N then (?”) is even. 


Proof. By definition, (=) is the number of n-element subsets of a set A with 2n 
elements. For each subset X ¢ A with |X|=n, the complement X is a different 
set, but it also has 2n—n=n elements. Imagine listing out all the n-elements 
subset of a set A. It could be done in such a way that the list has form 


X1,X1, X2,Xeo, X3,X3, X4,X4, X5, X5... 


This list has an even number of items, for they are grouped in pairs. Thus (24) 
is even. 


25. 


27. 
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a\(b a —b+c 
Ifa,b,ceN and c<b<a then (()(C)=(,7 )( 2"). 
Proof. Assume a,b,c €N with c<b <a. Then we have () = aie oot = 
a! (a—b+c)! _ a! (a—b+c)! _ ( a eS) a 
(a—b+c)'(a—b)! (b-c)!c! ~ (b-c)a—-b+c)! (a—b)!c! ~ \b-c Cc : 
Suppose a,beN. If ged(a,b)>1 then, a|b or 6 is not prime. 


Proof. Suppose gcd(a,b)>1, and let c = gcd(a,b) >1. Then since c is a divisor of 
both a and b, we have a=cx and b=cy for integers x and y. We divide into 
two cases according to whether or not b is prime. 

Case I. Suppose b is prime. Then the above equation b = cy with c>1 forces 
c=6 and y=1. Then a=cx becomes a = bx, which means a|b. We conclude 
that the statement “a|b or b is not prime,” is true. 

Case II. Suppose b is not prime. Then the statement “a|b or 6 is not prime,’ 
is automatically true. a 


9 


Chapter 5 Exercises 


1. 


Proposition Suppose né Z. If n? is even, then n is even. 


Proof. (Contrapositive) Suppose n is not even. Then n is odd, so n= 2a+1 for 
some integer a, by definition of an odd number. Thus n? = (2a+1)? = 4a7+4a+1= 
2(2a? +2a)+1. Consequently n? = 2b+1, where 6 is the integer 2a7+2a, so n? is 
odd. Therefore n? is not even. a 


. Proposition Suppose a,be¢ Z. If a2(b?— 2b) is odd, then a and b are odd. 


Proof. (Contrapositive) Suppose it is not the case that a and b are odd. Then, 
by DeMorgan’s law, at least one of a and b is even. Let us look at these cases 
separately. 

Case 1. Suppose a is even. Then a =2c for some integer c. Thus a?2(b? — 26) 
= (2c)?(b? — 2b) = 2(2c?(b? — 2b)), which is even. 

Case 2. Suppose b is even. Then b = 2c for some integer c. Thus a2(b? - 26) 
= a?((2c)? — 2(2c)) = 2(a?(2c? — 2c)), which is even. 

(A third case involving a and 6 both even is unnecessary, for either of the two 
cases above cover this case.) Thus in either case a?(b? — 2b) is even, so it is not 
odd. a 


. Proposition Suppose x¢R. If «7+5x«<0 then x<0. 


Proof. (Contrapositive) Suppose it is not the case that x <0, sox=0. Then 
neither x? nor 5x is negative, so x7+5x>0. Thus it is not true that x7+5«<0. 
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11. 


13. 


Proposition Suppose a,be Z. If both ab and a+b are even, then both a and 
b are even. 


Proof. (Contrapositive) Suppose it is not the case that both a and b are even. 
Then at least one of them is odd. There are three cases to consider. 

Case 1. Suppose a is even and 6 is odd. Then there are integers c and 
d for which a = 2c and b=2d+1. Then ab = 2c(2d+1), which is even; and 
a+b=2c+2d+1= 2(c+d)+1, which is odd. Thus it is not the case that both 
ab and a+b are even. 

Case 2. Suppose a is odd and 6b is even. Then there are integers c and d for 
which a = 2c+1 and 6=2d. Then ab = (2c + 1)(2d) = 2(d(2c + 1)), which is even; 
and a+ b=2c+1+2d = 2(c+d)+1, which is odd. Thus it is not the case that 
both ab and a+6 are even. 

Case 3. Suppose a is odd and b is odd. Then there are integers c and d for 
which a = 2c+1 and b=2d+1. Then ab = (2c+1)(2d+1)= 4cd+2c+2d+1= 
2(2cd +c +d)+1, which is odd; and a+b =2c+1+2d+1=2(c+d+1), which is 
even. Thus it is not the case that both ab and a+b are even. 

These cases show that it is not the case that ab and a+b are both even. (Note 
that unlike Exercise 3 above, we really did need all three cases here, for each 
case involved specific parities for both a and b.) a 


. Proposition Suppose ne Z. If 3{n?, then 3}n. 


Proof: (Contrapositive) Suppose it is not the case that 3{n, so 3|n. This means 
that n = 3a for some integer a. Consequently n? = 9a?, from which we get 
n? = 3(3a). This shows that there in an integer b = 3a? for which n? = 3b, which 
means 3|n?. Therefore it is not the case that 3{n?. a 


Proposition Suppose x,y¢Z. If x*(y+3) is even, then «x is even or y is odd. 


Proof. (Contrapositive) Suppose it is not the case that x is even or y is odd. 
Using DeMorgan’s law, this means x is not even and y is not odd, which is to 
say x is odd and y is even. Thus there are integers a and b for which x =2a+1 
and y = 2b. Consequently x?(y +3) = (2a + 1)7(2b +3) = (4a? + 4a + 1)(2b +3) = 
8a7b + 8ab + 2b + 12a? + 12a +3 = 8a7b + 8ab + 2b + 1207+ 12a+24+1= 

2(4a2b + 4ab+b+6a?+6a+1)+1. This shows x?(y+3)=2c+1 for c=4a2b+4ab+ 
b+6a2+6a+1¢€Z. Consequently, x?(y +3) is not even. a 


Proposition Suppose x¢R. If «°+7x?+5x>x4+x?4+8, then x>0. 


Proof. (Contrapositive) Suppose it is not true that x=>0. Then x <0, that is 
x is negative. Consequently, the expressions x°, 7x® and 5x are all negative 
(note the odd powers) so x° + 7x?+5x <0. Similarly the terms x‘, x?, and 8 
are all positive (note the even powers), so 0<x*+x2+8. From this we get 
x + 7x3 + 5x <x4+x2+8, so it is not true that x°+ 7x? +5x>x4+x7+8. a 


15. 
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Proposition Suppose x¢ Z. If x?—1 is even, then x is odd. 


Proof. (Contrapositive) Suppose x is not odd. Thus x is even, so x = 2a for some 
integer a. Then x? —1= (2a)? -1 = 8a? -1=8a?-2+1 =2(4a?—-1)+1. Therefore 
x? —1=2b+1 where b = 4a°-1€Z, so x®-1 is odd. Thus x?-1lis not even. 


Proposition If n is odd, then 8 |(n?-1). 


Proof. (Direct) Suppose n is odd, so n = 2a+1 for some integer a. Then n?-1= 
(2a+1)?-1= 4a?+4a = 4(a?+a)= 4a(a+1). So far we have n?-1=4a(a+1), but 
we want a factor of 8, not 4. But notice that one of a or a+1 must be even, so 
a(a+1) is even and hence a(a+1)=2c for some integer c. Now we have n?-1= 
4a(a+1)= 4(2c)=8c. But n?-1=8c means 8|(n?—-1). a 


Proposition Let a,b¢Z and neN. If a=b (mod n) and a=c (mod n), then 
c=b (mod n). 


Proof. (Direct) Suppose a= b (mod n) and a=c (mod n). 

This means n|(a—b) and n|(a-c). 

Thus there are integers d and e for which a— b=nd and a-c=ne. 
Subtracting the second equation from the first gives c-b=nd-—ne. 

Thus c—b=n(d-e), so n|(c—b) by definition of divisibility. 

Therefore c=b (mod n) by definition of congruence modulo n. a 


Proposition Let a,b¢Z and néeN. If a=b (mod n), then a? = b? (mod n). 


Proof. (Direct) Suppose a= b (mod n). This means n|(a— 6), so there is an 
integer c for which a—6b=nc. Then: 


a-b = ne 
(a—b\a?+ab+b") = ne(a?+ab+b?) 
a® +a7b+ab*—-ba*-ab*-b® = nc(a2+ab+b?) 
a®-b? = nc(a?+ab+b?). 


Since a? +ab+b?€Z, the equation a°— b? = nc(a2 +ab + 6”) implies n | (a? — b°), 


and therefore a? = 6° (mod n). a 
Proposition Let a,b,ceZ and neN. If a=b (mod n), then ca=cb (mod n). 


Proof. (Direct) Suppose a= b (mod n). This means n|(a—6), so there is an 
integer d for which a—b = nd. Multiply both sides of this by c to get ac—be =ndc. 
Consequently, there is an integer e = dc for which ac-— bc = ne, so n|(ac— bc) 
and consequently ac = bc (mod n). a 


If neN and 2” —1 is prime, then n is prime. 


Proof. Assume n is not prime. Write n=ab for some a,b>1. Then 2”-1= 
gv 1 = (2>—1)(27-6 4 226-26 5 gab-3b 4... 490-46) Hence 2”-1is composite. 
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27. If a=0 (mod 4) or a=1 (mod 4) then (§) is even. 


29. 


31. 


Proof. We prove this directly. Assume a = 0 (mod 4). Then (6) = Hen) Since 
a=4k for some k €N, we have ($) = “4#-» = 22(4k - 1). Hence ($) is even. 
Now assume a = 1 (mod 4). Then a = 4k +1 for some k EN. Hence ($) = Aer Di) = 


2k(4k + 1). Hence, (§) is even. This proves the result. a 
If integers a and 6 are not both zero, then ged(a, b) = ged(a — b, b). 


Proof. (Direct) Suppose integers a and bd are not both zero. Let d = ged(a,b). 
Because d is a divisor of both a and 6b, we have a=dx and b =dy for some 
integers x and y. Then a—~b=dx-dy=d(x-,y), so it follows that d is also a 
common divisor of a—6 and 6. Therefore it can’t be greater than the greatest 
common divisor of a—6 and b, which is to say ged(a,b)=d < gcd(a— 8, b). 


Now let e=gced(a—b,b). Then e divides both a-6 and 5, that is, a-b=ex and 
b =ey for integers x and y. Then a=(a—b)+b=ex+ey=e(x+7), SO NOW we see 
that e is a divisor of both a and b. Thus it is not more than their greatest 
common divisor, that is, gcd(a — b,b) =e < gcd(a,b). 


The above two paragraphs have given gcd(a,b) < gcd(a — 6,b) and ged(a — 6,6) < 
gcd(a,b). Thus ged(a, b) = ged(a — b, b). 
|_| 


Suppose the division algorithm applied to a and b yields a=qb+r. Then 
gcd(a, b) = ged(r, b). 


Proof. Suppose a=qb+r. Let d=gcd(a,b), so e is a common divisor of a and b; 
thus a = dx and b =dy for some integers x and y. Then dx =a=qb+r=qdy+r, 
hence dx =qdy+r, and so r=dx-—qdy =d(x—qy). Thus d is a divisor of r (and 
also of b), so gcd(a,b) = d < gced(r,b). 


On the other hand, let e = ged(r,b), so r=ex and b= ey for some integers x and 
y. Then a=qb+r=qey+ex=e(qy+x). Hence e is a divisor of a (and of course 
also of b) so ged(r,b) =e < gced(a, b). 


We’ve now shown ged(a,b) < ged(r, b) and ged(r, b) < ged(a, b), so ged(r, b) = ged(a, b). 
| 


Chapter 6 Exercises 


1. 


Suppose n is an integer. If n is odd, then n? is odd. 


Proof. Suppose for the sake of contradiction that n is odd and n? is not odd. 
Then n? is even. Now, since n is odd, we have n = 2a+1 for some integer a. 
Thus n? = (2a +1)? = 4a7+4a+1= 2(2a7+2a)+1. This shows n? = 2b+1, where 
b is the integer b = 2a2+2a. Therefore we have n? is odd and n? is even, a 
contradiction. | 
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3. Prove that <2 is irrational. 


Proof. Suppose for the sake of contradiction that \/2 is not irrational. Therefore 
it is rational, so there exist integers a and b for which /2= 5. Let us assume 
that this fraction is reduced, so a and b are not both even. Now we have 
ee (2)°, which gives 2 = a, or 2b°=a?. From this we see that a® is even, 
from which we deduce that a is even. (For if a were odd, then a® = (2c +1)? = 
8c? + 12c? +6c+1= 2(4c? + 6c? +3c)+1 would be odd, not even.) Since a is even, 
it follows that a = 2d for some integer d. The equation 2b? = a® from above then 
becomes 2b° = (2d)°, or 2b° = 8d°. Dividing by 2, we get b? = 4d?, and it follows 
that b? is even. Thus b is even also. (Using the same argument we used when 
a® was even.) At this point we have discovered that both a and 6 are even, 


contradicting the fact (observed above) that the a and b are not both even. @& 
Here is an alternative proof. 


Proof. Suppose for the sake of contradiction that 4/2 is not irrational. Therefore 
there exist integers a and 6 for which ¥/2= 5: Cubing both sides, we get 2= ee 
From this, a? = b°? + 5°, which contradicts Fermat’s last theorem. | 


5. Prove that V3 is irrational. 


Proof. Suppose for the sake of contradiction that V3 is not irrational. Therefore 
it is rational, so there exist integers a and b for which V3 = 5: Let us assume 
that this fraction is reduced, so a and b have no common factor. Notice that 
3 (2)’, so 3= Lp or 3b? =a”. This means 3|a?. 


Now we are going to show that if ae Z and 3|a?, then 3|a. (This is a proof- 
within-a-proof.) We will use contrapositive proof to prove this conditional 
statement. Suppose 3{a. Then there is a remainder of either 1 or 2 when 3 is 
divided into a. 

Case 1. There is a remainder of 1 when 3 is divided into a. Then a=3m+1 
for some integer m. Consequently, a? = 9m?2+6m+1= 3(3m?+2m)+1, and this 
means 8 divides into a? with a remainder of 1. Thus 3{a?. 

Case 2. There is a remainder of 2 when 3 is divided into a. Then a=3m+2 
for some integer m. Consequently, a2 = 9m?2+12m+4= 9m2+12m+3+1= 
3(83m2+4m+1)+1, and this means 3 divides into a? with a remainder of 1. Thus 
3{a?. 

In either case we have 3/a”, so we’ve shown 3}a implies 3{a?. Therefore, if 
3|a?, then 3|a. 

Now go back to 3|a? in the first paragraph. This combined with the result of 
the second paragraph implies 3|a, so a= 3d for some integer d. Now also in the 
first paragraph we had 3b? = a”, which now becomes 3b? = (3d)? or 3b? = 9d?, so 
b? = 3d?. But this means 3|b?, and the second paragraph implies 3|b. Thus 
we have concluded that 3|a and 3| 5, but this contradicts the fact that the 
fraction ¢ is reduced. a 
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11. 


13. 


15. 


. If a,be€Z, then a*-4b-30. 


Proof. Suppose for the sake of contradiction that a,b € Z but a?—4b-3=0. Then 
we have a? = 4b +3 = 2(2b+1)+1, which means a? is odd. Therefore a is odd also, 
so a=2c+1 for some integer c. Plugging this back into a? - 4b-3=0 gives us 


(2c+1)?-4b-3 = 
4c? +4c+1-4b-3 = 
4c? +4c-4b = 
2c7+2c-2b = 
Acr2+ce-b) = 


an 


From this last equation, we see that 1 is an even number, a contradiction. 


. Suppose a,be€R anda#0. Ifa is rational and ab is irrational, then 6 is 


irrational. 


Proof. Suppose for the sake of contradiction that a is rational and ab is irra- 
tional and 6b is not irrational. Thus we have a and b rational, and ab irrational. 
Since a and b are rational, we know there are integers c,d,e,f for which a= $ 
and b= Q Then ab = ap and since both ce and df are integers, it follows 
that ab is rational. But this is a contradiction because we started out with ab 
irrational. a 


There exist no integers a and 6 for which 18a + 6b = 1. 


Proof. Suppose for the sake of contradiction that there do exist integers a 
and b for which 18a+6b=1. Then 1 =2(9a+3b), which means 1 is even, a 
contradiction. a 


For every x € [z/2,7], sinx—cosx=>1. 


Proof. Suppose for the sake of contradiction that x € [7/2,7], but sinx—cosx <1. 
Since x € [7/2,7], we know sinx =0 and cosx <0, so sinx—cosx=0. Therefore 
we have 0 < sinx—cosx <1. Now the square of any number between 0 and 
1 is still a number between 0 and 1, so we have 0 <(sinx—cosx)? <1, or 0< 
sin? x —2sinxcosx+cos?x <1. Using the fact that sin? x+cos?x = 1, this becomes 
0<-2sinxcosx+1<1. Subtracting 1, we obtain —2sinxcosx <0. But above we 
remarked that sinx =0 and cosx <0, and hence —2sinxcosx = 0. We now have 
the contradiction —2sinxcosx <0 and —2sinxcosx = 0. | 


If be Z and b{k for every keEN, then b=0. 


Proof. Suppose for the sake of contradiction that b€ Z and b{k for every kEN, 
but b 40. 

Case 1. Suppose b>0. Then bEN, so b|d, contradicting b{k for every keEN. 
Case 2. Suppose b <0. Then —beEN, so b|(—b), again a contradiction a 
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17. For every ne€ Z, 4{(n? +2). 


19. 


21. 


Proof. Assume there exists n € Z with 4|(n?+2). Then for some k € Z, 4k =n? +2 
or 2k =n*+2(1-k). If n is odd, this means 2k is odd, and we’ve reached a 
contradiction. If n is even then n =2j and we get k =2j2+1-k for some je Z. 
Hence 2(k — ga) =1, so 1 is even, a contradiction. | 


Remark. It is fairly easy to see that two more than a perfect square is always 
either 2 (mod 4) or 3 (mod 4). This would end the proof immediately. 


The product of 5 consecutive integers is a multiple of 120. 


Proof. Given any collection of 5 consecutive integers, at least one must be a 
multiple of two, at least one must be a multiple of three, at least one must be 
a multiple of four and at least one must be a multiple of 5. Hence the product 
is a multiple of 5-4-3-2=120. In particular, the product is a multiple of 60. 


Hints for Exercises 20-23. For Exercises 20, first show that the equation 
a” +b? = 3c? has no solutions (other than the trivial solution (a,b,c) = (0,0,0)) 
in the integers. To do this, investigate the remainders of a sum of squares 

(mod 4). After you’ve done this, prove that the only solution is indeed the trivial 
solution. 

Now, assume that the equation x? + y?—3=0 has a rational solution. Use the 
definition of rational numbers to yield a contradiction. 


Chapter 7 Exercises 


1. 


. Given an integer a, then a 


Suppose xe Z. Then x is even if and only if 3x +5 is odd. 


Proof. We first use direct proof to show that if x is even, then 3x+5 is odd. 
Suppose x is even. Then x= 2n for some integer n. Thus 3x+5= 3(2n)+5= 
6n+5=6n+4+1= 2(8n+2)+1. Thus 3x+5 is odd because it has form 2k +1, 
where k = 3n+2€ Z. 


Conversely, we need to show that if 3x+5 is odd, then x is even. We will 
prove this using contrapositive proof. Suppose x is not even. Then x is odd, so 
x=2n+1 for some integer n. Thus 3x+5= 3(2n+1)+5= 6n+8=2(8n+4). This 
means says 3x+5 is twice the integer 3n+4, so 3x+5 is even, not odd. a 


3 2 


+a“+a is even if and only if a is even. 


Proof. First we will prove that if a? +a?+a is even then a is even. This is done 


with contrapositive proof. Suppose a is not even. Then a is odd, so there is an 
integer n for which a=2n+1. Then 


a®+a7+a = (2nt+1)?+(Qn4+1)?+(Qn+1) 
= 8n°+12n?7+6n+1+4n?+4n+1+2n+4+1 
= 8n°+16n7+12n+2+1 
= 2(4n?+8n?+6n+1)+1. 
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3 2 3 2 


+a as twice an integer plus 1, so a’? +a“+a is odd, not 
24a is even then a is even. 


This expresses a’ +a 
even. We have now shown that if a? +a 


2+ais even. We will use 


Conversely, we need to show that if a is even, then a?+a 
direct proof. Suppose a is even, so a = 2n for some integer n. Then a?+a?+a= 
(2n)? + (2n)? +2n = 8n? + 4n?2+2n = 2(4n?+2n2+n). Therefore, a?+a?2+a is even 


because it’s twice an integer. a 


5. An integer a is odd if and only if a® is odd. 


Proof. Suppose that a is odd. Then a =2n+1 for some integer n, and a? = 
(2n +1)? = 8n?+12n?+6n+1= 2(4n?+6n2+3n)+1. This shows that a° is twice 
an integer, plus 1, so a® is odd. Thus we’ve proved that if a is odd then a? is 
odd. 


Conversely we need to show that if a® is odd, then a is odd. For this we employ 
contrapositive proof. Suppose a is not odd. Thus a is even, so a = 2n for some 
integer n. Then a® = (2n)? = 8n = 2(4n°) is even (not odd). a 


7. Suppose x,yeR. Then (x+y)? =x?+y? if and only if x=0 or y=0. 


Proof. First we prove with direct proof that if (x+y)? =x?+y?, then x=0 or 
y=0. Suppose (x+y)? =x?+y7. From this we get x7+2xy+y? =x?+y?, so 2xy =0, 
and hence xy=0. Thus x=0 or y=0. 


Conversely, we need to show that if x =0 or y=0, then (x+y)? =x?+y?. This 
will be done with cases. 

Case 1. If x =0 then (x+y)? =(0+ y)? = y? =07+ y2 =x7 4 y?. 

Case 2. If y=0 then (x+y)? =(«+ 0)? =x? =x? +0? =x7 + y?. 

Either way, we have (x+y)? =x?+ y?. | 


9. Suppose ae Z. Prove that 14|a if and only if 7|a and 2|a. 


Proof. First we prove that if 14|a, then 7|a and 2|a. Direct proof is used. 
Suppose 14|a. This means a= 14m for some integer m. Therefore a = 7(2m), 
which means 7|a, and also a = 2(7m), which means 2|a. Thus 7|a and 2|a. 


Conversely, we need to prove that if 7|a and 2|a, then 14|a. Once again direct 
proof if used. Suppose 7|a and 2|a. Since 2|a it follows that a =2m for some 
integer m, and that in turn implies that a is even. Since 7|a it follows that 
a=‘n for some integer n. Now, since a is known to be even, and a =7n, it 
follows that n is even (if it were odd, then a = 7n would be odd). Thus n = 2p for 
an appropriate integer p, and plugging n = 2p back into a=7n gives a=7(2p), 
so a=14p. Therefore 14|a. B 


11. 


13. 


15. 


17. 
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Suppose a,b€ Z. Prove that (a—3)b? is even if and only if a is odd or b is even. 


Proof. First we will prove that if (a—3)b? is even, then a is odd or b is even. 
For this we use contrapositive proof. Suppose it is not the case that a is 
odd or b is even. Then by DeMorgan’s law, a is even and bd is odd. Thus 
there are integers m and n for which a = 2m and b = 2n+1. Now observe 
(a—3)b? = (2m—3)(2n+ 1)? = (2m-3)(4n?+4n+1) = 8mn?4+8mn+2m-12n?-12n-3 = 
8mn? + 8mn + 2m - 12n? -12n-4+4+1= 24mn2 + 4mn+m-6n?2 -6n-2)+1. This 
shows (a —3)b? is odd, so it’s not even. 


Conversely, we need to show that if a is odd or b is even, then (a —3)b? is even. 
For this we use direct proof, with cases. 

Case 1. Suppose a is odd. Then a = 2m+1 for some integer m. Thus (a—3)b? = 
(2m +1-3)b? = (2m —2)b? = 2(m—1)b?. Thus in this case (a —3)b? is even. 

Case 2. Suppose b is even. Then 6 = 2n for some integer n. Thus (a —3)b? = 
(a — 3)(2n)” = (a—3)4n? = 2(a—3)2n? =. Thus in this case (a —3)b? is even. 
Therefore, in any event, (a—3)b? is even. | 


Suppose a,be Z. If a+b is odd, then a? +b? is odd. 
Hint: Use direct proof. Suppose a+b is odd. Argue that this means a and b 
have opposite parity. Then use cases. 


Suppose a,be Z. Prove that a+b is even if and only if a and b have the same 
parity. 


Proof. First we will show that if a+6 is even, then a and b have the same parity. 
For this we use contrapositive proof. Suppose it is not the case that a and b 
have the same parity. Then one of a and b is even and the other is odd. Without 
loss of generality, let’s say that a is even and b is odd. Thus there are integers 
m and n for which a =2m and 6=2n+1. Then a+b=2m+2n+1=2%m+n)+1, 
so a+b is odd, not even. 


Conversely, we need to show that if a and b have the same parity, then a+b is 
even. For this, we use direct proof with cases. Suppose a and b have the same 
parity. 

Case 1. Both a and 6 are even. Then there are integers m and n for which 
a=2m and b=2n, soa+b=2m+2n=2(m+n) is clearly even. 

Case 2. Both a and 6 are odd. Then there are integers m and n for which 
a=2m+1 and b=2n+1, soa+b=2m+1+2n+1=2(m+n+1) is clearly even. 
Either way, a+b is even. This completes the proof. a 


There is a prime number between 90 and 100. 


Proof. Simply observe that 97 is prime. a 
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19. 


21. 


23. 


25. 


If ne N, then 2°+21+27+423+244...42% =9"+1_1, 


Proof. We use direct proof. Suppose neN. Let S be the number 

S209 014 924.09 pote or +4 oF. (1) 
In what follows, we will solve for S and show S = 2”*!-1. Multiplying both 
sides of (1) by 2 gives 

2S = 21497493 494495 4...497 49041, (2) 
Now subtract Equation (1) from Equation (2) to obtain 2S —S = —2°+2”+1, 
which simplifies to S = 2”*1-—1. Combining this with Equation (1) produces 
29491492493 4944...42"=2"+1_1, so the proof is complete. | 


Every real solution of x°+«+3=0 is irrational. 


Proof. Suppose for the sake of contradiction that this polynomial has a rational 
solution ¢. We may assume that this fraction is fully reduced, so a and b are 
not both even. Observe that we have Gs +£+3=0. Clearing the denominator 
gives 

a®+ab?+3b? =0. 


Consider two cases: First, if both a and b are odd, the left-hand side is a sum 
of three odds, which is odd, meaning 0 is odd, a contradiction. Second, if one 
of a and b is odd and the other is even, then the middle term of a® +ab?+3b° 
is even, while a® and 3b? have opposite parity. Then a®+ab?+3b° is the sum 
of two evens and an odd, which is odd, again contradicting the fact that 0 is 
even. | 


Suppose a,b and c are integers. If a|b and a|(b?—c), then a|c. 


Proof. (Direct) Suppose a |b and a|(b2-—c). This means that 6 = ad and 
b?-—c=ae for some integers d and e. Squaring the first equation produces 
b? =a?d?. Subtracting b?-—c=ae from b? = a?d? gives c = a?d? —ae = a(ad? —e). 
As ad?-e€Z, it follows that a|c. | 


If p >1 is an integer and n{p for each integer n for which 2<n< ,/p, then p is 
prime. 


Proof. (Contrapositive) Suppose that p is not prime, so it factors as p= mn for 
l<m,n<p. 


Observe that it is not the case that both m > \/p and n > \/p, because if this were 
true the inequalities would multiply to give mn > \/p,/p = p, which contradicts 
p=mn. 


Therefore m < \/p or n<,/p. Without loss of generality, say n < ,/p. Then the 
equation p = mn gives n| p, with 1<n<,/p. Therefore it is not true that n{p 
for each integer n for which 2<n< \,/p. a 


27. 


29. 


31. 


33. 


35. 
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Suppose a,be Z. If a?7+b? is a perfect square, then a and 6b are not both odd. 


Proof. (Contradiction) Suppose a? +b? is a perfect square, and a and b are both 
odd. As a?+b? is a perfect square, say c is the integer for which c? =a?+6?. As 
a and 6 are odd, we have a=2m+1 and b=2n+1 for integers m and n. Then 


c2 =a7+b2 =(2m4+1)2+(2n4 1)? =4(m2 4+ n2+mn)+2. 


This is even, so c is even also; let c=2k. Now the above equation results in 
(2k)? = 4(m?2 +n? +mn)+2, which simplifies to 2k? = 2(m?+n?+mn)+1. Thus 2k? 
is both even and odd, a contradiction. | 


If a| bc and ged(a,b)=1, then a|c. 


Proof. (Direct) Suppose a | bc and gcd(a,b)=1. The fact that a | bc means bc = az 
for some integer z. The fact that gcd(a,b)=1 means that ax+by=1 for some 
integers x and y (by the proposition on page 124). From this we get acx+bcy=c; 
substituting bc = az yields acx+azy=c, that is, a(cex+zy)=c. Thereforea|c. 


If ne Z, then ged(n,n+1)=1. 


Proof. Suppose d is a positive integer that is a common divisor of n and n+1. 
Then n=dx and n+1=dy for integers x and y. Then 1=(n+1)-n=dy-—dx= 
d(y—x). Now, 1=d(y-x) is only possible of d=+1 and y—-x=+1. Thus the 
greatest common divisor of n and n+1 can be no greater than 1. But 1 does 
divide both n and n+1, so ged(n,n+1)=1. | 


If ne Z, then ged(2n + 1,4n?+1)=1. 


Proof. Note that 4n?2+1=(2n+1)(2n—1)+2. Therefore, it suffices to show that 
gcd(2n + 1,(2n + 1)(2n—-1)+2)=1. Let d be a common positive divisor of both 
2n+1 and (2n+1)(2n-—1)+2, so 2n+1=dx and (2n+1)(2n-—1)+2=dy for integers 
x and y. Substituting the first equation into the second gives dx(2n—1)+2=dy, 
so 2=dy—dx(2n-1)=d(y—-2nx-—x). This means d divides 2, so d equals 1 or 
2. But the equation 2n + 1=dx means d must be odd. Therefore d = 1, that is, 
gcd(2n + 1,(2n + 1)(2n -1)+2)=1. | 


Suppose a,beN. Then a = gced(a,b) if and only if a| b. 


Proof. Suppose a = gcd(a,b). This means a is a divisor of both a and b. In 
particular a|b. 


Conversely, suppose a|b. Then a divides both a and b, so a<gcd(a,b). On the 
other hand, since ged(a,b) divides a, we have a = gcd(a,b)-x for some integer x. 
As all integers involved are positive, it follows that a= gcd(a,b). 


It has been established that a < gcd(a,b) and a= gcd(a,b). Thus a=gcd(a,b). Mf 
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Chapter 8 Exercises 


1. 


Prove that {12n:neZ}<{2n:nEeZ}n{38n:ne Z}. 


Proof. Suppose ae {12n:ne Z}. This means a = 12n for some ne Z. Therefore 
a = 2(6n) and a = 3(4n). From a = 2(6n), it follows that a is multiple of 2, so 
ae {2n:neZ}. From a= 3(4n), it follows that a is multiple of 3, so ace {8n: ne Z}. 
Thus by definition of the intersection of two sets, we have a€é {2n:néeZ}n 
{8n:neZ}. Thus {12n:neZc{2n:neZ}n{38n:neE Z}. | 


. IfkeZ, then {neZ:n|k}C{neZ:n|k*}. 


Proof. Suppose k € Z. We now need to show {ne Z:n|k} ¢ {neZ:n|k*}. 

Suppose ae {ne Z:n|k}. Then it follows that a|k, so there is an integer c for 
which k =ac. Then k? =a?c?. Therefore k? = a(ac”), and from this the definition 
of divisibility gives a|k?. But a|k? means that ae {ne Z:n|k?}. We have now 
shown {neéZ:n|k} S {neZ:n|k*}. a 


. If p and q are integers, then {pn:neN}n{qn:neEN}F@. 


Proof. Suppose p and q are integers. Consider the integer pg. Observe that 
pq €{pn:neN} and pq é {qn:neN}, so pgeé {pn:neN}n{qn:neN}. Therefore 
{pn:neN}n{qn:nEeN}F@. | 


. Suppose A,B and C are sets. If BCC, then Ax BC AxC. 


Proof. This is a conditional statement, and we'll prove it with direct proof. 
Suppose B<C. (Now we need to prove Ax BCA xC.) 


Suppose (a,b)¢ AxB. Then by definition of the Cartesian product we have ace A 
and b€B. But since b€ B and BEC, we have beC. Since ace A and dEC, it 
follows that (a,b)¢ Ax C. Now we’ve shown (a,b)¢ A x B implies (a,b)¢ Ax C, so 
AxBCAxC. 


In summary, we’ve shown that if Bc C, then Ax BC AxC. This completes the 
proof. a 


. If A,B and C are sets then AnN(BUC)=(ANB)U(ANC). 


Proof. We use the distributive law PA(QVR) = (PAQ)v(P AR) from page 48. 


AN(BUC) ={x:xeA aA xe BuC} (def. of intersection) 
={x:xE€A A (xEeB v xEC)} (def. of union) 
={x:(xeA a xeB)v(xeA A xeC)} (distributive law) 
={x : (xe ANB) v (xe ANC)} (def. of intersection) 
= (ANB)U(ANC) (def. of union) 


The proof is complete. a 
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11. If A and B are sets in a universal set U, then AUB =ANB. 


Proof. Just observe the following sequence of equalities. 


AUB =U-(AUB) (def. of complement) 
= {x:(xeU)A(x¢ AUB)} (def. of —) 
= {x:(xEU)A~(xE AUB) 
= {x:(xeEU)n ~ ((x€ A) v(x € B))} (def. of U) 
= {x: (x EU) A(~ (XE A)A ~ (x € B))} (DeMorgan) 
= {x: (mE U)A(xE A)A(XEB)} 
={x:(xEU)A(KEU)A(KEA)A(KEB) (xEU)=(x EU) A(x EU) 


= {x:((xEU)A(x¢€ A) AC(xEU)A(x¢B))} (regroup) 
= {x:(xeU)A(x¢€A)}n{x:(xEU)A(x¢B)} (def. of n) 


= (U-A)n(U-B) (def. of —) 
=-AnB (def. of complement) 
The proof is complete. a 


13. If A,B and C are sets, then A—(BUC)=(A-B)n(A-C). 


Proof. Just observe the following sequence of equalities. 


A-(BUC) = {x:(xeE A)A(x¢BUC)} (def. of —) 
={x:(xEA)A~(xEBUC)} 
= {x:(x€ A)A ~ (x EB) Vv (xe C))} (def. of U) 
= {x:(x€ A)A(~ (x€ B)A ~ (x € C))} (DeMorgan) 
= {x:(xE A)A(XEB)A(x €C)} 
= {x:(xE A)A(XEA)A(KEB)A(xEC)} (x€ A)=(x€ A)A(x€ A) 


= {x: (xe A)A(x¢€B)AC(xE A)A(xEC))} (regroup) 
= {x:(xE A)A(x EB) N{x:(xE A)A(xEC)} (def. of n) 
= (A-B)n(A-C) (def. of —) 
The proof is complete. a 


15. If A,B and C are sets, then (ANB)—C =(A-—C)n(B-C). 


Proof. Just observe the following sequence of equalities. 


(ANB)-C = {x: («Ee ANB)A(x€C)} (def. of —) 
= {x:(xE A)A(KEB)A(x€C)} (def. of n) 
= {x:(xE A)A(XEC)A(XEB)A(x€¢ C)} (regroup) 


= {x: (xe A)A(xEC)AC(xEB)A(x€C))} (regroup) 
= fx: (we A)A(XEC)}N{x:(xE B)A(x¢€C)} (def. of n) 
= (A-C)n(B-C) (def. of n) 
The proof is complete. a 


17. If A,B and C are sets, then A x (BNC)=(Ax B)n(AxC). 


Proof. See Example 8.12. iz 
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19. 


21. 


23. 


25. 


Prove that {9” :ne€ Z}<{3" :ne Z}, but 9" :ne Z} 4 {38":ne Zh. 


Proof. Suppose ac {9":neé Z}. This means a =9” for some integer ne Z. Thus 
a = 9” = (37) = 32”. This shows a is an integer power of 3, so a€ {3":ne€Z}. 
Therefore a {9”: ne Z} implies ac {3" :n€ Z}, so {9" :n€ Z}C{3" :ne Z}. 


But notice {9”:ne Z}#4 {3":ne Z} as 3€ {3": ne Z}, but 3¢ {9" :neE Z}. a 
Suppose A and B are sets. Prove ACB if and only if A-B=@. 


Proof. First we will prove that if ACB, then A—B=9. Contrapositive proof is 
used. Suppose that A-B#@. Thus there is an element ac A—B, which means 
acéA but aé¢B. Since not every element of A is in B, we have AZB. 


Conversely, we will prove that if A-B=@, then ACB. Again, contrapositive 
proof is used. Suppose A¢B. This means that it is not the case that every 
element of A is an element of B, so there is an element ac A with a¢ B. 
Therefore we have ae A-B, so A-BF@. | 


For each a €R, let Ag = {(x,a(x” — 1))€ R?: xe R}. Prove that (] Aq = {(-1,0),(1,0))}. 


acR 


Proof. First we will show that {(-1,0),(1,0))} ¢ (] Aa. Notice that for any ae R, 
aeR 
we have (—1,0)€ A, because A, contains the ordered pair (—1,a((—1)?—1) = (1,0). 
Similarly (1,0)¢ Ag. Thus each element of {(—1,0),(1,0))} belongs to every set 
Aa, so every element of (| Aa, so {(-1,0),(1,0))} © (] Aa. 
aeR aeR 
Now we will show [{) Aa €{(-1,0),(1,0))}. Suppose (c,d)€ (] Aa. This means 
acR aeR 

(c,d) is in every set Ag. In particular (c,d) € Ao = {(xc, O(x? -1)):xe R} = {(x,0): x € R}. 
It follows that d=0. Then also we have (c,d) =(c,0)€ Ay = {(x, A(x? -1)):xe€ R}= 
{(x,x2-1):xeR}. Therefore (c,0) has the form (c,c?—1), that is (c,0) =(c,c?— 1). 
From this we get c?-1=0, so c=+1. Therefore (c,d) = (1,0) or (c,d) = (-1,0), 
so (c,d) € {(—1,0),(1,0))}. This completes the demonstration that (c,d) € (| Aa 


acR 
implies (c,d) € {(-1,0),(1,0))}, so it follows that () Aa € {(-1,0),(1,0))}. 
aceR 
Now it’s been shown that {(-—1,0),(1,0))}}¢ (] Aa and (] Aa € {((-1,0),(1,0))}, so it 
aceR acR 
follows that () Aa = {(-1,0),(1,0))}. | 


aceR 


Suppose A,B,C and D are sets. Prove that (A x B)U(C x D) <(AUC)x (BUD). 


Proof. Suppose (a,b)€(A x B)U(C x D). 

By definition of union, this means (a,b)« (A x B) or (a,b) €(C x D). 

We examine these two cases individually. 

Case 1. Suppose (a,b)€(AxB). By definition of x, it follows that ae A and 
beB. From this, it follows from the definition of U that ae AUC and b€ BUD. 
Again from the definition of x, we get (a,b)« (AUC) x (BUD). 


27. 


29. 


31. 
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Case 2. Suppose (a,b)€(C xD). By definition of x, it follows that ae C and 
beD. From this, it follows from the definition of U that ae AUC and b€ BUD. 
Again from the definition of x, we get (a,b) «(A UC)x (BUD). 

In either case, we obtained (a,b)<€ (AUC) x (BUD), 

so we've proved that (a,b) «(A x B)U(C x D) implies (a,b) € (AUC) x (BUD). 
Therefore (A x B)U(C x D)< (AUC) x (BUD). | 


Prove {12a+4b:a,b€Z}={4ce:ce Z}. 


Proof. First we show {12a +4b:a,b€ Z}¢ {4c:ce Z}. Suppose x € {12a + 4b:a,b€ Z}. 
Then x= 12a+4b for some integers a and b. From this we get x = 4(3a+), so 
x= 4c where c is the integer 3a+b. Consequently x € {4c:ce€ Z}. This establishes 
that {12a+4b:a,beZ}¢ {4ce:ce Z}. 

Next we show {4c:ce Z}S{12a+4b:a,be Z}. Suppose xe {4c:ce Z}. Then x= 4c 
for some ce Z. Thus x = (12+ 4(—2))c = 12c + 4(-2c), and since c and —2c are 
integers we have x€ {12a+4b:a,be Z}. 

This proves that {12a+4b:a,be€ Z}={4c:ceé Z}. a 


Suppose A#@. Prove that Ax BC AxC, if and only if BCC. 


Proof. First we will prove that if AxBCAxC, then BCC. Using contrapositive, 
suppose that B¢C. This means there is an element ce C with c¢ B. Since 
A#Q, there exists an element ac A. Now consider the ordered pair (a,c). Note 
that (a,c)e Ax C, but (a,c) AxB. This means Ax BZAxC. 


Conversely, we will now show that if BCC, then Ax BC AxC. We use direct 
proof. Suppose BCC. Assume that (a,b)¢ Ax B. This means ace A and be B. 
But, as BCC, we also have b€ C. From aeA and bEC, we get (a,b)Ee AxC. 
We’ve now shown (a,b)¢ A xB implies (a,b)e AxC, so Ax BGAXxC. a 


Suppose B4% and Ax BC BxC. Prove ACC. 


Proof. Suppose B 4 ¢ and Ax BCBxC. In what follows, we show that A ¢C. 
Let xe A. Because B is not empty, it contains some element b. Observe that 
(x,b)e AxB. But as Ax BC BxC, we also have (x,b)e€ Bx C, so, in particular, 
xe€B. AsxeA and xe B, we have (x,x)©€ AxB. But as Ax BCBxC, it follows 
that (x,x)¢BxC. This implies xeC. 


Now we’ve shown xe A implies xe C, so ACC. a 


Chapter 9 Exercises 


1. 


If x,yeR, then |x+ y| =|x|+ yl. 

This is false. 

Disproof: Here is a counterexample: Let x=1 and y=~—1. Then |x+¥|=0 and 
|x| +|y|=2, so it’s not true that |x + y| =|x|+|yI. 
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11. 


13. 


15. 


17. 


19. 


. IfneZ and n®*—n is even, then n is even. 


This is false. 
Disproof: Here is a counterexample: Let n =3. Then n> —n=3°-—3= 240, but 
n is not even. 


. If A, B,C and D are sets, then (A x B) U(C x D)=(AUC) x (BUD). 


This is false. 

Disproof: Here is a counterexample: Let A = {1,2}, B = {1,2}, C = {2,3} and 
D = {2,3}. Then (A x B)U(C x D) = {(1, 1), (1, 2), (2, 1), (2, 2)} U {(2, 2), (2, 3), (3, 2), (3, 3)} = 
{(1, 1), (1, 2), (2, 1), (2, 2), (2,3), (3, 2),(8,3)}. Also (AUC) x (BUD) = {1,2,3} x {1,2,3}= 
{(1, 1),(1, 2), (1, 3), (2, 1), (2, 2), (2, 3), (3, 1), (3, 2),(3,3)}, so you can see that (A x B)u 
(C x D)#(AUC)x (BUD). 


. If A, B and C are sets, and Ax C=BxC, then A=B. 


This is false. 
Disproof: Here is a counterexample: Let A = {1}, B = {2} and C=¢@. Then 
AxC=BxC=@, but AFB. 


. If A and B are sets, then A(A)- A(B)S A(A-B). 


This is false. 

Disproof: Here is a counterexample: Let A = {1,2} and B= {1}. Then A(A)- 
P(B) = {D, {1}, {2}, {1, 2}} — {D, {1}} = {{2}, {1,2}. Also AA-B)= PA({2}) = {@, {2}}. In 
this example we have “(A)- A(B)Z A(A—B). 

If a,beN, then a+b<ab. 

This is false. 

Disproof: Here is a counterexample: Let a=1 and b=1. Then a+b=2 and 
ab =1, so it’s not true that a+b<ab. 


There exists a set X for which RC X and @c€X. This is true. 


Proof. Simply let X =Ru{@}. If xe R, then xe RU{G}=X, so RC X. Likewise, 
DERU{G}=X because Ge {G}. ial 


Every odd integer is the sum of three odd integers. This is true. 


Proof. Suppose n is odd. Then n=n+1+(-1), and therefore n is the sum of 
three odd integers. a 


For all sets A and B, if A-B=@, then B#@. 

This is false. 

Disproof: Here is a counterexample: Just let A=¢@ and B=@. Then A-B=@Q, 
but it’s not true that BZ @. 

For every r,s € Q with r <s, there is an irrational number u for which r<u<s. 
This is true. 


Proof. (Direct) Suppose r,s € Q with r<s. Consider the number u =r+ V285. 
In what follows we will show that u is irrational and r<u<s. Certainly since 


21. 


23. 
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s—r is positive, it follows that r<r+V/255" =u. Also, since V2 <2 we have 
2 


s-r s-r 
=r+v2 <r+2 = 
u=r+V2 ; r 5 s, 


and therefore u<s. Thus we can conclude r<u<s. 


Now we just need to show that u is irrational. Suppose for the sake of contra- 
diction that wu is rational. Then u = ¢ for some integers a and b. Since r and s 
are rational, we have r= § and s= ¢ for some c,d,e,f « Z. Now we have 


s-r 
~ j=— 
u r+ 5 
a Cc wind 
= 2 
b gy 
ad-—be ed—cf 
= 2 
bd v2 2df 
(ad — bc)2df 
Mace ge Mine 9 
bd(ed —-cf) Me 


This expresses V2 as a quotient of two integers, so V2 is rational, a contradiction. 
Thus uw is irrational. 


In summary, we have produced an irrational number u with r<u<s, so the 
proof is complete. a 


There exist two prime numbers p and gq for which p — g = 97. 

This statement is false. 

Disproof: Suppose for the sake of contradiction that this is true. Let p and 
qg be prime numbers for which p — q = 97. Now, since their difference is odd, p 
and g must have opposite parity, so one of p and q is even and the other is 
odd. But there exists only one even prime number (namely 2), so either p =2 
or g=2. If p=2, then p-—q =97 implies g = 2—97 = -95, which is not prime. 
On the other hand if g = 2, then p—q =97 implies p = 99, but that’s not prime 
either. Thus one of p or q is not prime, a contradiction. 


If x,ye€R and x® < y®, then x<y. This is true. 


Proof. (Contrapositive) Suppose x= y. We need to show x?= y®. 

Case 1. Suppose x and y have opposite signs, that is one of x and y is positive 
and the other is negative. Then since x= y, x is positive and y is negative. 
Then, since the powers are odd, x° is positive and y® is negative, so x? = y°. 
Case 2. Suppose x and y do not have opposite signs. Then x7+xy+y?>0 and 
also x— y>0 because x >y. Thus we have x? — y? =(«— y)(«?+xy+y?)=0. From 
this we get x°— y?=0, so x°> y?. 

In either case we have x? = y®. | 
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25. 


27. 


29. 


31. 


33. 


For all a,b,cé Z, if a| bc, then a|b ora|c. 
This is false. 
Disproof: Let a=6, b=3 and c=4. Note that a|bc, but a{b and afc. 


The equation x? = 2* has three real solutions. 


Proof. By inspection, the numbers x = 2 and x = 4 are two solutions of this 
equation. But there is a third solution. Let m be the real number for which 
m2 = 5: Then negative number x = —2m is a solution, as follows. 


2 
m2m 2 1 1 
1? =(-2m)? = 4m? = 4("——) -4( 4) = = 972m _ 9% 
gm gm 


Therefore we have three solutions 2, 4 and m. | 


If x,yeER and |x+y|=|x—yl|, then y=0. 
This is false. 
Disproof: Let x=0 and y=1. Then |x+¥y/=|x—y|, but y=1. 


No number appears in Pascal’s triangle more than four times. 
Disproof: The number 120 appears six times. Check that (1) = (49) = (48) = 


(12) = (42°) = (ig) = 120. 
Suppose f(x) = ag +a1x+a9x"+-+-+a,x”" is a polynomial of degree 1 or greater, 


and for which each coefficient a; is in N. Then there is an né€N for which the 
integer f(n) is not prime. 


2 


Proof. (Outline) Note that, because the coefficients are all positive and the 
degree is greater than 1, we have f(1)>1. Let b= f(1)>1. Now, the polynomial 
f(x)-b has a root 1, so f(x)—b = (x—1)g(x) for some polynomial g. Then 
f(x) = (x-1)g(x)+ 6b. Now note that f(6+1) = bg(b)+b = b(g(b) +1). If we can 
now show that g(b)+1 is an integer, then we have a nontrivial factoring 
f(6 +1) = b(g(b)+1), and f(b+1) is not prime. To complete the proof, use the 
fact that f(x)—b =(x—1)g(x) has integer coefficients, and deduce that g(x) must 
also have integer coefficients. a 


Chapter 10 Exercises 


1. 


For every integer neN, it follows that 1+2+3+4+---+n= 


Proof. We will prove this with mathematical induction. 


1741 
(1) Observe that if n =1, this statement is 1= 


, which is obviously true. 
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(2) Consider any integer k => 1. We must show that Sk implies Sz+1. In other 
words, we must show that if 1+2+3+4+---+k= "= is true, then 


(k+1)7+(k+1) 


14+2+34+4+:---+k+(R+1))= 5 


is also true. We use direct proof. : 
Suppose k= 1 and 1+2+3+4+---+k= "3". Observe that 


ll 


14+24+34+4+---+k+(k+1) 
(14+24+34+4+---+k)+(R+1) 
2 2 

k BG arp k*+k+2(k +1) 

2 2 
k274+2k+1+k+1 
2 
(k+1)7+(k+1) 

5 ‘ 


Therefore we have shown that 1+2+34+44+---+k+(k+1)= ee. a 


3. For every integer néN, it follows that 1° +23+33+43+-.-+n3= cea 

Proof. We will prove this with mathematical induction. 

(1) When n=1 the statement is 1° = ey = 4 = 1, which is true. 

(2) Now assume the statement is true for some integer n =k = 1, that is assume 
1242? +32 4.48 4-4 RFS ust Sa Observe that this implies the statement is 
true for n=k+1. 


134.93 4.33443 4...4434+(k+1) 


(13423 4.33443 +..-+23)4+(k+1)? = 
k(k +1)" 


R(k+1)? | A(k+ 0° 
4 4 
h?2(k +1)? +4(k+1)° 
4 
(k + 1)7(k2 + 4(k + 1)!) 
4 
(k + 1)7(k2 + 4k +4) 
4 

(k+1)2(k +2)? 
4 
(k+1)2((k +1)+1)? 
4 


ll 


+(k+1)? 


Therefore 1° +2? +3°+43+..-+43+(k+1)8 = Gard 1y which means the 
statement is true for n=k+1. | 
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5. IfneN, then 91492493 4...49n -gntl_9o 


Proof. The proof is by mathematical induction. 

(1) When n =1, this statement is 2! = 2!*!-2, or 2=4-2, which is true. 

(2) Now assume the statement is true for some integer n = k = 1, that is assume 
21492493 4...42% —9k+1_92. Observe this implies that the statement is true 
for n=k+1, as follows: 


OL OF P09 folie OM 4 Oe et 
(Pr OPO? Oh) OMe 
gktl_94 okt = 9.9kt1_9 
gkt2 _o 
gktD+1_ 9 


Thus we have 2! +2?+4+2°+...42% 4+ 9+1 =9%+D+1_9 so the statement is true 


for n=k+1. 
Thus the result follows by mathematical induction. a 
1)2 7 
7. IfneN, then 1-34+2-443-5+4-6+--+n(n42)= meee 


Proof. The proof is by mathematical induction. 

(1) When n=1, we have 1-3= 1OsRErD which is the true statement 3 = 8. 

(2) Now assume the statement is true for some integer n = k = 1, that is assume 
1-34+2-44+3-54+4-6+---+R(R+2)= BED ORYD | Now observe that 


1-342-44+3-5+4-64+---+k(R+2)+(R+1)(R4+1)4+2) = 
(1-34+2-44+3-54+4-64+---+Rk(R+2))+(R+1(R4+1)+2) = 


me ED ee MR +D+2) 


R(kR+1)2k+7) a 6(k + 1)(k +3) 


6 6 
R(R+1)(2k + 7)+ 6(R + 1)(k +3) 


6 
(k+1)(R(2k + 7) + 6(R +3)) 
6 

(k + 1)(2k2 + 13k +18) 


6 
(k+1)(k + 2)(2k +9) 


6 
(kR+1(Rk +1)4+1)(2(R4+1)+7) 


6 


ll 


Thus we have 1-3+2-4+3-5+4-64+---+k(R+2)+(R+1)(R+1)+2) = PGs Ds DOE sD 
and this means the statement is true for n=k+1. 
Thus the result follows by mathematical induction. a 


9. 


11. 


13. 
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For any integer n =0, it follows that 24|(52” — 1). 


Proof. The proof is by mathematical induction. 
(1) For n=0, the statement is 24| (529-1). This is 24|0, which is true. 


(2) Now assume the statement is true for some integer n =k => 1, that is assume 
24|(52* 1). This means 5?” — 1 = 24a for some integer a, and from this we 
get 52* = 24a +1. Now observe that 


5757-1 = 


5°(24a+1)-1 = 
25(24a+1)-1 = 
25-24a+25-1 = 24(25a+1). 


This shows 52+) — 1 = 24(25a +1), which means 24| 524+) — 1. 


This completes the proof by mathematical induction. a 


For any integer n= 0, it follows that 3|(n?+5n +6). 


Proof. The proof is by mathematical induction. 
(1) When n =0, the statement is 3|(0?+5-0+6), or 3/6, which is true. 


(2) Now assume the statement is true for some integer n =k = 0, that is assume 
3|(k2+5k+6). This means k?+5k+6= 3a for some integer a. We need to 
show that 3|((k + 1)? +5(k+1)+6). Observe that 


(R+1)?+5(R+1)4+6 = h?4+3k74+3k4+1+5k+54+6 
= (k2+5k+6)+3k7+3k+6 
= 8a+3k7+3k+6 
= 3a+k?+k+2). 


Thus we have deduced (k + 1)? -(k +1) =3(a+k7+k+2). Since a+k?+k+2 is 
an integer, it follows that 3|((k +1)? +5(k+1)+6). 


It follows by mathematical induction that 3|(n?+5n+6) for every n= 0. a 


For any integer n = 0, it follows that 6| (n° —n). 


Proof. The proof is by mathematical induction. 
(1) When n=0, the statement is 6|(0?-0), or 6|0, which is true. 
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(2) Now assume the statement is true for some integer n = k = 0, that is, assume 
6|(k®—k). This means k® —k = 6a for some integer a. We need to show that 
6|((k+1)?-(k+1)). Observe that 


(k+1)?-(k+1) = k?4+3k24+3k+1-k-1 
= (k®—k)+3k7+3k 
= 6a+3k2+3k 
= 6a+38k(k+1). 


Thus we have deduced (k + 1)?—(k+1)=6a+3k(k+1). Since one of k or (k +1) 
must be even, it follows that k(k+1) is even, so k(k+1)=2b for some integer 
b. Consequently (k + 1)?-(k +1) = 6a + 3k(k +1) = 6a+3(2b) = 6(a+b). Since 
(k +1)? -(k+1)=6(a+b) it follows that 6|((k+1)?-(k+1)). 


Thus the result follows by mathematical induction. a 


Hsing Gg ae A is © oh 1 
15. If neN, then (ot og t ga hag Pt Seep LO Se 


Proof. The proof is by mathematical induction. 


(1) When n= 1, the statement is Top =1- cs which simplifies to 5 = 5. 


(2) Now assume the statement is true for some integer n = k = 1, that is assume 
jotogtaatagtect RED =1-,4,. Next we show that the statement for 
n=k+1is true. Observe that 


1 1 1 1 1 1 
+ + + tee + = 
1:2 9-3 3-4 4:5 k(k+1) (k+1)(k+1)4+1) 
+ s + A + t + + 4 J+ : = 
1-2 2-3 3-4 4-5 k(k+1)) (R+1)\(R+2) | 


| 1 | 1 
1 + = 
k+1 (k+1)(R +2) 

1 1 
k+l (k+1k+2) 
k+2 1 
(k+1(k+2) (R+D(R+2) 
k+1 
(k+1)\(R +2) 
1 


Panes eee 
k+2 


jae 
(k+1)+1° 


: ‘ 1 1 1 1 1 = 1 ‘ ‘ 
This establishes Tatog3t3at ott Gaps Ds1 = 1- spr which is to Say 


that the statement is true for n=k+1. 


This completes the proof by mathematical induction. a 
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17. Suppose A;,Ao,...A, are sets in some universal set U, and n= 2. Prove that 


19. 


21. 


Ain Agn-:-NA, = AyUAQU::-UAp. 


Proof. The proof is by strong induction. ie 
(1) When n = 2 the statement is AynN Ag = A,UAg. This is not an entirely 
obvious statement, so we have to prove it. Observe that 


Aj nAg 


{x:(xeEU)a(x¢ A, NAg)} (definition of complement) 

= {x:(xEeU)A~(xE€AyNAg)} 

= {x:(xeU)A~((x€ A1)A(x€ Agd))} (definition of n) 

= {x:(xE€U)A(~ (xe Ay)v ~(x€ Ag))} (DeMorgan) 

= {x:(xeU)NA((x€¢ Aq) V(x €¢ Ag))} 

= {x:(xeEU)A(x¢ Az) V(x EU) A(x ¢ Ag)} (distributive prop.) 
= {x:(wEU)A(x€¢ Az) Ulx: (xe U) A(x ¢ Ag))} (def. of U) 

= A,UA, (definition of complement) 


(2) Let k=>2. Assume the statement is true if it involves k or fewer sets. Then 


A,NAgNn:-::NAp_1NARNARS1 = 
AinAgn--NAp_1~n(ApNnApey) = AyUAgu---UAg_-1UARN ARs 


= A,UA9QU---UAp_1UA,UApG1 


Thus the statement is true when it involves k +1 sets. 
This completes the proof by strong induction. a 


Prove ©”_, 1/k? <2-1/n for every n. 
h=1 


Proof. This clearly holds for n = 1. Assume it holds for some n=1. Then 
Lett Vk? <2-Int 1(n+1)? =2- (n+1)"—n <9_1/(n +1). The proof is complete. M 


n(n+1)7 


If neN, then 4+44+44..-4 2514 
, [> 2" 3 2 


NIs 


Proof. If n=1, the result is obvious. 
Assume the proposition holds for some n > 1. Then 


1 (; 1 1 =] 1 1 1 5] 
= +o fa 4feeet + ee 


1 
+= fee + + + —— 
3 gn+1 1 2 8 2n 27+1 2742 2°43 gn+1 


1 
—-+- 
1 2 


n 1 1 1 1 
> (1+ }+( + + +--+ —_], 
2 27+1 2742 2°43 gust 


Now, the sum (sty + 5 + ag free sir] on the right has 2”*!-2” = 2” terms, 


all greater than or equal to oa , so the sum is greater than 2” sit = 3: Therefore 


i. 5 es ol 1 n 1 1 1 1 a 
we get 4+ 3+ 3 +--+ ahr > (1+ 3)+(gty + gg + hg t+ gd} = (149) 49 = 


1+ ned This means the result is true for n +1, so the theorem is proved. 
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23. 


25. 


27. 


Use induction to prove the binomial theorem (x + y)” =D". (‘)x”"‘y’. 


Proof. Notice that when n = 1, the formula is (x+y)! = (j)xty® + (j)x°y! =x+y, 
which is true. 


Now assume the theorem is true for some n> 1. We will show that this implies 
that it is true for the power n+1. Just observe that 


yt = (xt yaxt yy” 


- een ry 
OW 


Hees p (res 
i=o\! 


l 


(2 


- ey (et ae 


(x+y 


int D-i i nt+1 


yotry 


1 n+1 


This shows that the formula is true for (x+y)"*!, so the theorem is proved. @ 
Concerning the Fibonacci sequence, prove that F,+Fo+F3+F4+...+F,=Fps9-1. 


Proof. The proof is by induction. 

(1) When n=1 the statement is Fy = F14,-1=F3-1=2-1=1, which is true. 
Also when n = 2 the statement is F, + Fo = Fog -1=F4-1=3-1=2, which is 
true, as Fj + Fo=14+1=2. 

(2) Now assume k21 and Fi +Fo+F3+F4+...+F, =Fpig—1. We need to show 
Fi +Fot+F3+F4a+...+Fe+ Feo =Fe+3—-1. Observe that 


Fi +Fo+F3+Fat+...+F,+Fpiy = 

(FP, +Fot+F3+Fat+...+Fe)+ Frey 
Feyg-1++Fei1 = (Feiit+Feig)-1 

Fpi3-1. 


This completes the proof by induction. a 
Concerning the Fibonacci sequence, prove that F, + F3+--:+Fon-1=Foan. 


Proof. If n = 1, the result is immediate. Assume for some n> 1 we have 
De Foi-1 = Fan. Then O74} Foi-1 = Fons1t Ly Foi-1 = Fonsit Fon = Fons2 = Fans 
as desired. | 


29. 


31. 


33. 


275 


Prove that (5) + ea + (ee) + =) feed (4) + (-) 2 ree 


Proof. (Strong Induction) For n = 1 this is (3) + 7) =1+0=1=Fo=Fy,4,. Thus 
the assertion is true when n= 1. 


)+ (?) = Fy41 whenever 


Now fix n and assume that (‘)+ (ere eae tet (, 
+(,'). We also often use 


k<n. In what follows we use the identity (/) = {7} 
(;) = 0 whenever it is untrue that 0<b <a. 


Cee heen 


= Fyt+Fyp-1=F n 
This completes the proof. a 


Prove that >”_, (") = (ti), where reN. 


Hint: Use induction on the integer n. After doing the basis step, break up the 
k-1 


expression (*) as (*) =(" + (a). Then regroup, use the induction hypothesis, 


and recombine using the above identity. 

Suppose that n infinitely long straight lines lie on the plane in such a way that 
no two are parallel, and no three intersect at a single point. Show that this 
arrangement divides the plane into ea regions. 

Proof. The proof is by induction. For the basis step, suppose n = 1. Then there 
is one line, and it clearly divides the plane into 2 regions, one on either side of 
the line. As 2= e142 = —- the formula is correct when n =1. 

Now suppose there are n+1 lines on the plane, and that the formula is correct 
for when there are n lines on the plane. Single out one of the n+1 lines on the 
plane, and call it 2. Remove line /, so that there are now n lines on the plane. 


By the induction hypothesis, these n lines ‘4 
divide the plane into ““*"*2 regions. Now add 5 
line ¢ back. Doing this adds an additional A 
n+1 regions. (The diagram illustrates the 

case where n+1=5. Without /, there are 5 


n=4 lines. Adding ¢ back produces n+1=5 
new regions.) ‘1 
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Thus, with n+1 lines there are all together (n+ 1)+ nent? regions. Observe 
n2+nt+2 An+2+n24+n+2 (n+1)?2+(n+1)+2 
a ee rr 


2 
Thus, with n+1 lines, we have (net) ine D+? regions, which means that the 


formula is true for when there are n+1 lines. We have shown that if the 
formula is true for n lines, it is also true for n+1 lines. This completes the 
proof by induction. a 


35. If n,keN, and n is even and k is odd, then &) is even. 
Proof. Notice that if k is not a value between 0 and n, then (i) = 0 is even; thus 
from here on we can assume that 0<k<n. We will use strong induction. 


For the basis case, notice that the assertion is true for the even values n= 2 


and n=4: (*) = 2; ({) =4; (3) =4 (even in each case). 
Now fix and even n assume that (7) is even whenever m is even, k is odd, and 


m<n. Using the identity (‘) = (77})+(";') three times, we get 


fe) = ae 


| | 
—_ 
xf x Sf 
I ot 
So wb So bw 
ee 
+ + 
Oo as 
—_—_ ws 
x 3 
a ~) 
Hw ——— 
rs aE 
F — 
——_- ws 
a) 
~ 1 et pw 
wo" 
— 
— 
3 
= 
bo 
— 


Now, n-2 is even, and k and k-2 are odd. By the inductive hypothesis, the 
outer terms of the above expression are even, and the middle is clearly even; 
thus we have expressed (7) as the sum of three even integers, so it is even. 


Chapter 11 Exercises 
Section 11.0 Exercises 
1. Let A = {0,1,2,3,4,5}. Write out the relation R that expresses > on A. Then 


illustrate it with a diagram. 
2 1 


R= (5,4), (5,3), (5,3), (5,3), (5, D, (5, 0), (4,3), (4, 2), (4, D, 
(4,0), (3, 2), (3, 1), (3, 0), (2, 1), (2,0), (1, 0)} 3 0 


4 5 


3. Let A = {0,1,2,3,4,5}. Write out the relation R that expresses = on A. Then 
illustrate it with a diagram. 
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R = 4(5,5),(5,4), (5,3), (5,2), (5, 1), (5,0), 
(4,4), (4,3), (4, 2), (4, 1), (4,0), 
(3,3), (3, 2), (3, 1), (8, 0), 
(2,2), (2, 1), (2,0), (1, 1), (1, 0), (0,0)} 


5. The following diagram represents a relation R on a set A. Write the sets A 
and R. Answer: A = {0,1,2,3,4,5}; R = {(3,3),(4,3), (4,2), (1,2), (2,5), (5, 0)} 

7. Write the relation < on the set A= Z as a subset R of Zx Z. This is an infinite 
set, so you will have to use set-builder notation. 
Answer: R = {(x,y)e Zx Z:y—-xeEN} 


9. How many different relations are there on the set A = {1,2,3,4,5,6}? 
Consider forming a relation RC Ax A on A. For each ordered pair (x,y)e Ax A, 
we have two choices: we can either include (x,y) in R or not include it. There 
are 6-6 = 36 ordered pairs in Ax A. By the multiplication principle, there are 
thus 2°° different subsets R and hence also this many relations on A. 


11. Answer: 24!” 13. Answer: # 15. Answer: = (mod 3) 


Section 11.1 Exercises 


1. Consider the relation R = {(a,a),(b,b),(c,c),(d,d),(a, b),(b,a)} on the set A = {a,b,c,d}. 
Which of the properties reflexive, symmetric and transitive does R possess and 
why? If a property does not hold, say why. 

This is reflexive because (x,x)¢R (i.e., xRx )for every xe A. 
It is symmetric because it is impossible to find an (x, y)¢R for which (y,x)¢R. 
It is transitive because (xRy A yRz) => xRz always holds. 


3. Consider the relation R = {(a,b),(a,c),(c, b),(b,c)} on the set A = {a,b,c}. Which 
of the properties reflexive, symmetric and transitive does R possess and why? 
If a property does not hold, say why. 
This is not reflexive because (a,a)¢R (for example). 
It is not symmetric because (a,b)€R but (b,a)¢R. 
It is not transitive because cRb and bRc are true, but cRc is false. 


5. Consider the relation R = {(0,0), (V2, 0), (0, V2),(V2, V2)} on R. Say whether this 
relation is reflexive, symmetric and transitive. If a property does not hold, say 
why. 
This is not reflexive because (1,1)¢R (for example). 
It is symmetric because it is impossible to find an (x, y)¢ R for which (y,x)¢R. 
It is transitive because (xRy A yRz)=>xRz always holds. 

7. There are 16 possible different relations R on the set A = {a,b}. Describe all of 


them. (A picture for each one will suffice, but don’t forget to label the nodes.) 
Which ones are reflexive? Symmetric? Transitive? 
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11. 


13. 


ga ay & + ae) Cm 
ae—sh @—b ae) 
Aen _*b (Gb de _6) Ca 
ab (Gb a <>) 


Only the four in the right column are reflexive. Only the eight in the first and 
fourth rows are symmetric. All of them are transitive except the first three 
on the fourth row. 


. Define a relation on Z by declaring xRy if and only if x and y have the same 


parity. Say whether this relation is reflexive, symmetric and transitive. If a 
property does not hold, say why. What familiar relation is this? 

This is reflexive because xRx since x always has the same parity as x. 

It is symmetric because if x and y have the same parity, then y and x must 
have the same parity (that is, xRy > yRx). 

It is transitive because if x and y have the same parity and y and z have the 
same parity, then x and z must have the same parity. (That is («RyAyRz) > xRz 
always holds.) 

The relation is congruence modulo 2. 

Suppose A = {a,b,c,d} and R = {(a,a),(b, b),(c,c),(d,d)}. Say whether this relation 
is reflexive, symmetric and transitive. If a property does not hold, say why. 
This is reflexive because (x,x)¢R for every xe A. 

It is symmetric because it is impossible to find an (x, y)¢ R for which (y,x)¢R. 
It is transitive because (xRy A yRz)=>xRz always holds. 

(For example (aRa AaRa)=> aRa is true, etc.) 


Consider the relation R = {(x,y)e Rx R:x—yeZ} on R. Prove that this relation 
is reflexive and symmetric, and transitive. 


Proof. In this relation, xRy means x— ye Z. 


To see that R is reflexive, take any x€ R and observe that x-x=0€ Z, so xRx. 
Therefore R is reflexive. 


To see that R is symmetric, we need to prove xRy => yRx for all x,yeR. We 
use direct proof. Suppose xRy. This means x—yeZ. Then it follows that 
—(x-y)=y-x is also in Z. But y-xe Z means yRx. We’ve shown xRy implies 
yRx, so R is symmetric. 


To see that R is transitive, we need to prove (xRy A yRz) > xRz is always 
true. We prove this conditional statement with direct proof. Suppose xRy and 
yRz. Since xRy, we know x—ye Z. Since yRz, we know y-ze Z. Thus x-y 
and y—z are both integers; by adding these integers we get another integer 
(x—y)+(y-z)=x-z. Thus x—ze Z, and this means xRz. We’ve now shown that 
if xRy and yRz, then xRz. Therefore R is transitive. a 
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15. Prove or disprove: If a relation is symmetric and transitive, then it is also 
reflexive. 
This is false. For a counterexample, consider the relation R = {(a,a),(a,b),(b,a),(b, b)} 
on the set A = {a,b,c}. This is symmetric and transitive but it is not reflexive. 


17. Define a relation ~ on Z as x~ y if and only if |x—y|<1. Say whether ~ is 
reflexive, symmetric and transitive. 
This is reflexive because |x—x|=0<1 for all integers x. It is symmetric because 
x~y if and only if |x—y| <1, if and only if |y—x| <1, if and only if y ~x. It is not 
transitive because, for example, 0~ 1 and 1 ~ 2, but is not the case that 0 ~ 2. 


Section 11.2 Exercises 


1. Let A = {1,2,3,4,5,6}, and consider the following equivalence relation on A: R= 
{(1, 1), (2,2), (3,3), (4, 4), (5, 5), (6, 6), (2, 3), (3, 2), (4,5), (5, 4), (4, 6), (6, 4), (5, 6),(6,5)}. List 
the equivalence classes of R. 


The equivalence classes are: [1]={1}; [2]=[8]={2,3}; [4]=[5]=[6]={4,5,6}. 
3. Let A = {a,b,c,d,e}. Suppose R is an equivalence relation on A. Suppose R has 
three equivalence classes. Also aRd and bRc. Write out R as a set. 
Answer: R = {(a,a),(b,6),(c,¢),(d,d),(e,e),(a,d),(d,a),(b,c),(c, b)}. 
5. There are two different equivalence relations on the set A = {a,b}. Describe 
them all. Diagrams will suffice. 
Answer: R = {(a,a),(b,b)} and R = {(a,a),(b,b),(a, b),(b,a)} 
7. Define a relation R on Z as xRy if and only if 3x—5y is even. Prove R is an 
equivalence relation. Describe its equivalence classes. 


To prove that R is an equivalence relation, we must show it’s reflexive, sym- 
metric and transitive. 

The relation R is reflexive for the following reason. If x¢ Z, then 3x—5x = —2x 
is even. But then since 3x—5x is even, we have xRx. Thus R is reflexive. 


To see that R is symmetric, suppose xRy. We must show yRx. Since xRy, we 
know 3x—5y is even, so 3x —5y = 2a for some integer a. Now reason as follows: 


38x-5y = 2a 
8x-5y+8y-8x = 2a+8y-8x 
38y-5x = 2a+4y-A4x). 


From this it follows that 3y—5x is even, so yRx. We’ve now shown xRy implies 
yRx, so R is symmetric. 


To prove that R is transitive, assume that xRy and yRz. (We will show that this 
implies xRz.) Since xRy and yRz, it follows that 3x—5y and 3y—5z are both even, 
so 8x—5y = 2a and 3y—5z = 2b for some integers a and b. Adding these equations, 
we get (8x -—5y)+ (3y—5z) = 2a+2b, and this simplifies to 3x—-5z =2(a+b+y). 
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Therefore 3x—5z is even, so xRz. We’ve now shown that if xRy and yRz, then 
xRz, so R is transitive. 


We’ve now shown that R is reflexive, symmetric and transitive, so it is an 
equivalence relation. 


The completes the first part of the problem. Now we move on the second part. 
To find the equivalence classes, first note that 


[0] = {xe Z:xRO}= {xe Z:3x—5-0 is even} = {x « Z:3x is even} = {xe Z:x is even}. 
Thus the equivalence class [0] consists of all even integers. Next, note that 
[1] ={x¢€ Z:xR1} = {x€Z:3x-5-1 is even} = {xe Z:3x—5 is even} = {xe Z:x is odd}. 


Thus the equivalence class [1] consists of all odd integers. 


Consequently there are just two equivalence classes {...,—4,—2,0,2,4,...} and 
{...,-3,-1,1,3,5,...}. 


9. Define a relation R on Z as xRy if and only if 4|(x+3y). Prove R is an 
equivalence relation. Describe its equivalence classes. 


This is reflexive, because for any x€ Z we have 4|(x+3x), so xRx. 


To prove that R is symmetric, suppose xRy. Then 4|(x+3y), so x+3y= 4a 
for some integer a. Multiplying by 3, we get 3x+9y = 12a, which becomes 
yt+38x=12a-—8y. Then y+3x=4(38a-2y), so 4|(y+3x), hence yRx. Thus we’ve 
shown xRy implies yRx, so R is symmetric. 


To prove transitivity, suppose xRy and yRz. Then 4|(x+3y) and 4|(y+3z), so 
x+8y=4a and y+3z = 4b for some integers a and 6. Adding these two equations 
produces x+4y+3z=4a+4b, or x+8z=4a+4b-4y=4(a+b-y). Consequently 
A\(x+3z), so xRz, and R is transitive. 


As R is reflexive, symmetric and transitive, it is an equivalence relation. 


Now let’s compute its equivalence classes. 

[O]={xeZ:xRO}= {xe Z:4|(x+3-Of={xeZ:4|x}= {...-4,0,4,8,12,16...} 
[Ij={xeZ:xR1}={xeZ:4|(x+3-D}={xeZ:4|(x+3)} =f..-3,1,5,9,13,17...} 
[2]={xeZ:xR2}= {xe Z:4|(x+3-2)}={xeZ:4|(x+6)} ={...-—2,2,6,10,14,18...} 
[3]={xe Z:xR3} = {xe Z:4|(x+3-3)} ={xeZ:4|(x+ 9) ={...-1,3,7,11,15,19...} 


11. Prove or disprove: If R is an equivalence relation on an infinite set A, then R 
has infinitely many equivalence classes. 
This is False. Counterexample: consider the relation of congruence modulo 2. 
It is a relation on the infinite set Z, but it has only two equivalence classes. 


13. Answer: m|A| 15. Answer: 15 
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Section 11.3 Exercises 


1. 


List all the partitions of the set A = {a,b}. Compare your answer to the answer 
to Exercise 5 of Section 11.2. 

There are just two partitions {{a},{b}} and {{a,b}}. These correspond to the two 
equivalence relations R, = {(a,a),(b,b)} and Re = {(a,a),(a,b),(b,a),(b,b)}, respec- 
tively, on A. 


. Describe the partition of Z resulting from the equivalence relation = (mod 4). 


Answer: The partition is {[0],[1],[2],[3]} = 
{{...,-4,0,4,8, 12,...},{...,-3,1,5,9, 18,...}, {...,-2,2,4,6,10,14,...}, {...,-1,8,7,11,15,...}} 


. Answer: Congruence modulo 2, or “same parity.” 


Section 11.4 Exercises 


1. 


Write the addition and multiplication tables for Zo. 


EO}: EO = WOT = iE 
[0] | [0] [1] [0] | [0] [0] 
[1] | [1] [0] [1] | [0] [1] 


3. Write the addition and multiplication tables for Z4. 


+ | [0] [1] [2] [3] - | (0) (1) [2] [3] 
[0] | (0) [1] [2] [3] [0] | [0] [0] [0] [0] 
[1]; 11] [2] [3] [0] [1] | [0] [1] [2] [3] 
[2] | [2] [38] [0] [1] [2] | [0] [2] [0] [2] 
[3] | [3] [0] [1] [2] [3] | [0] [38] [2] [1] 


5. Suppose [a],[b]e Zs and [a]-[b] =[0]. Is it necessarily true that either [a] = [0] 


or [b] =[0]? 


The multiplication table for Z5; is shown in Section 11.4. In the body of that 
table, the only place that [0] occurs is in the first row or the first column. That 
row and column are both headed by [0]. It follows that if [a]-[b] = [0], then 
either [a] or [b] must be [0]. 


. Do the following calculations in Zg, in each case expressing your answer as [a] 


with 0<a<8. 
(a) [8]+[8] =[7] (b) [24] +[11] = [8] (c) [21]-[15] = [0] (d) [8]-[8] = [1] 
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Chapter 12 Exercises 


Section 12.1 Exercises 


1. 


11. 


Suppose A = {0,1,2,3,4}, B = {2,3,4,5} and f = {((0,3), (1,3), (2,4), (3, 2),(4,2)}. State 
the domain and range of f. Find f(2) and f(1). 
Domain is A; Range is {2,3,4}; f(2)=4; f(1)=8. 


. There are four different functions f : {a,b} — {0,1}. List them all. Diagrams will 


suffice. 
f1 = ((a,0),(6,0)} fe = ((a,1),(6,0)}, f3 = {(a,0),(6,1)} fa = (Ca, 1), (0, I} 


. Give an example of a relation from {a,b,c,d} to {d,e} that is not a function. 


One example is {(a,d),(a,e),(b,d),(c,d),(d,d)}. 


. Consider the set f = {((x,y)e Zx Z:3x+y=4}. Is this a function from Z to Z? 


Explain. 
Yes, since 3x+ y=4 if and only if y=4- 3x, this is the function f : Z — Z defined 
as f(x)=4- 3x. 


. Consider the set f = {(x?,x):x¢R}. Is this a function from R to R? Explain. 


No. This is not a function. Observe that f contains the ordered pairs (4,2) and 
(4,-2). Thus the real number 4 occurs as the first coordinate of more than one 
element of /. 

Is the set 0 = {(X,|X|):X © Zs} a function? If so, what is its domain and range? 
Yes, this is a function. The domain is A(Z5). The range is {0,1,2,3,4,5}. 


Section 12.2 Exercises 


1. 


Let A = {1,2,3,4} and B = {a,b,c}. Give an example of a function f:A—B that 
is neither injective nor surjective. 

Consider f = {(1,a),(2,a),(3,a),(4,a)}. 

Then f is not injective because f(1)= f(2). 

Also f is not surjective because it sends no element of A to the element ce B. 


. Consider the cosine function cos :R — R. Decide whether this function is injective 


and whether it is surjective. What if it had been defined as cos:R— [—1,1]? 
The function cos :R — R is not injective because, for example, cos(0) = cos(27). It 
is not surjective because if b =5¢ R (for example), there is no real number for 
which cos(x) = 6. The function cos :R— [-1,1] is surjective. but not injective. 


. A function f :Z— Z is defined as f(n)=2n+1. Verify whether this function is 


injective and whether it is surjective. 

This function is injective. To see this, suppose m,ne Z and f(m) = f(n). 
This means 2m+1=2n+1, from which we get 2m = 2n, and then m=n. 

Thus f is injective. 

This function is not surjective. To see this notice that f(n) is odd for all 
neéZ. So given the (even) number 2 in the codomain Z, there is no n with 
f(n) = 2. 
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7. A function f :Zx Z-— Z is defined as f((m,n)) =2n-—4m. Verify whether this 


11. 


function is injective and whether it is surjective. 

This is not injective because (0,2) 4 (—1,0), yet f((0,2)) = f(-1,0))=4. This is 
not surjective because f((m,n)) = 2n—4m = 2(n—-2m) is always even. If be Z 
is odd, then f((m,n)) #4 b, for all (m,n)€ Zx Z. 


. Prove that the function f :R—{2}— R-— {5} defined by f(x) = batt is bijective. 


Proof. First, let’s check that f is injective. Suppose f(x) = f(y). Then 


Sxt+1l — 5y+1 
x-2 y-2 
(5x+1)(y-2) = (y+1)(«-2) 
5xy—-10x+y-2 = B5yx-l0y+x-2 
-10x+y = -10y+x 
lly = 11x 
Yo cS: 


Since f(x) = f(y) implies x = y, it follows that f is injective. 

Next, let’s check that f is surjective. For this, take an arbitrary element 
b € R—{5}. We want to see if there is an x € R— {2} for which f(x) =), or batt =b. 
Solving this for x, we get: 


5x+1 = b(x-2) 
5x+1 = bx-2b 
5x-xb = -2b-1 
x(5-—b) = -2b-1. 


Since we have assumed b € R- {5}, the term (5-6) is not zero, and we can 
-2b-1 


divide with impunity to get x= Se This is an x for which f(x)= 6, so f is 


surjective. 
Since f is both injective and surjective, it is bijective. a 


Consider the function 6: {0,1} x N— Z defined as @(a,b) =(—1)%b. Is @ injective? 
Is it surjective? Explain. 

First we show that @ is injective. Suppose 0(a,b) = 0(c,d). Then (—1)%b =(-1)°d. 
Since b and d are both in N, they are both positive. Therefore since (—1)%b = 
(-1)¢°d it follows that (-1)* and (—1)° have the same sign. Since each of (—1)% 
and (—1)° equals +1, we have (—1)* = (—1)®, so then (—1)%b = (-1)°d implies 6 = d. 
But also (—1)* =(—1)° means a and b have the same parity, and since a,b € {0,1} 
if that follows a= 6. Thus (a,b) =(c,d), so @ is injective. 

Next note that @ is not surjective because 0(a,b)=(—1)%b is either positive or 
negative, but never zero. Therefore there exist no element (a,b) {0,1} x N for 
which O(a,b)=0€ Z. 
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13. 


15. 


17. 


Consider the function f :R? — R? defined by the formula f(x,y) = (xy,x°). Is f 
injective? Is it surjective? 

Notice that f(1,0) = (0,0) and f(0,0) = (0,0), so f is not injective. To show that f 
is also not surjective, we will show that it’s impossible to find an ordered pair 
(x,y) with f(x,y)=(1,0). If there were such a pair, then f(x, y) = («y,x°) = (1,0), 
which yields xy = 1 and x? = 0. From x° = 0 we get x = 0, so xy = 0, a contradiction. 
This question concerns functions f :{A,B,C,D,E,F,G} — {1,2,3,4,5,6,7}. How 
many such functions are there? How many of these functions are injective? 
How many are surjective? How many are bijective? 

Function f can described as a list (f(A), f(B), f(C), f(D), f(E), fF), f(G)), where 
there are seven choices for each entry. By the multiplication principle, the total 
number of functions f is 7’ = 823543. 

If f is injective, then this list can’t have any repetition, so there are 7! = 5040 
injective functions. Since any injective function sends the seven elements of the 
domain to seven distinct elements of the codomain, all of the injective functions 
are surjective, and vice versa. Thus there are 5040 surjective functions and 
5040 bijective functions. 

This question concerns functions / :{A,B,C,D,E,F,G}— {1,2}. How many such 
functions are there? How many of these functions are injective? How many 
are surjective? How many are bijective? 

Function f can described as a list (f(A), f(B), f(C), f(D), f(E), fF), f(G)), where 
there are two choices for each entry. Therefore the total number of functions 
is 27=128. It is impossible for any function to send all seven elements of 
{A,B,C,D,E,F,G} to seven distinct elements of {1,2}, so none of these 128 
functions is injective, hence none are bijective. 

How many are surjective? Only two of the 128 functions are not surjective, and 
they are the “constant” functions {(A,1),(B,1),(C,1),(D, 1), (E, 1), (F, 1),(G, 1)} and 
{(A, 2), (B, 2),(C, 2), (D, 2), (E, 2),(F,2),(G,2)}. So there are 126 surjective functions. 


Section 12.3 Exercises 


i. 


If 6 integers are chosen at random, at least two will have the same remainder 
when divided by 5. 


Proof. Write Z as follows: Z = Us {5k +j:k¢€Z}. This is a partition of Z into 5 
sets. If six integers are picked at random, by the pigeonhole principle, at least 
two will be in the same set. However, each set corresponds to the remainder 
of a number after being divided by 5 (for example, {5k+1:k€ Z} are all those 
integers that leave a remainder of 1 after being divided by 5). a 


. Given any six positive integers, there are two for which their sum or difference 


is divisible by 9. 


Proof. If for two of the integers n,m we had n =m (mod 9), then n—m =0 (mod 9), 
and we would be done. Thus assume this is not the case. Observe that the 
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only two element subsets of positive integers that sum to 9 are {1,8}, {2,7}, {3, 6}, 
and {4,5}. However, since at least five of the six integers must have distinct 
remainders from 1, 2, ..., 8 it follows from the pigeonhole principle that two 
integers n,m are in the same set. Hence n+ m=0 (mod 9) as desired. a 


. Prove that any set of 7 distinct natural numbers contains a pair of numbers 


whose sum or difference is divisible by 10. 


Proof. Let S = {a1,a2,a3,a@4,a5,a6,a7} be any set of 7 natural numbers. Let’s say 
that a1 <a9q<a3<-:-<a7. Consider the set 


A = {a1—a2,a,—a@3,a1—@4,a1—4a5, 41-46, 41-47, 
a1+Q9,a1,+a3, @1 +04, 01 +a45, a1 +46, a,+a7} 


Thus |A| = 12. Now let B= {0,1,2,3,4,5,6,7,8,9}, so |B] =10. Let f:A—B be the 
function for which f(n) equals the last digit of n. (That is f(97)=7, f(12)=2, 
f (230) = 0, etc.) Then, since |A|>|B|, the pigeonhole principle guarantees that 
f is not injective. Thus A contains elements a;+a; and a;+a; for which 
f(a, +a;)= f(a;+a,;). This means the last digit of a; +a; is the same as the last 
digit of a; +a,;. Thus the last digit of the difference (a; +a;)—(a1+a;)= +aj+a; 
is 0. Hence +a;+a; is a sum or difference of elements of S that is divisible by 
10. a 


Section 12.4 Exercises 


1. 


Suppose A = {5,6,8}, B = {0,1}, C = {1,2,3}. Let f:A—B be the function f = 
{(5, 1),(6,0),(8,1)}, and g:B—C be g=({(0,1),(1,1)}. Find gof. 
gof ={(5, 1), (6, 1), (8, D} 


. Suppose A = {1,2,3}. Let f:A—A be the function f = {(1,2),(2,2),(3,1)}, and let 


g:A—A be the function g = {(1,3),(2,1),(3,2)}. Find gof and fog. 
g°f ={0,0),2,D),63,3)}; fog ={0, D,(, 2), (3, 2)}. 


. Consider the functions f,g:R—R defined as f(x) = /x+1 and g(x)=«?. Find 


the formulas for gof and fog. 
gof(x)=x+1l; fog()= Vx 41 


. Consider the functions f,g:ZxZ-— ZxZ defined as f(m,n) = (mn,m?) and 


g(m,n)=(m+1,m+n). Find the formulas for gof and fog. 
Note gof(m,n)= g(f(m,n)) = g(mn,m?”) =(mn+1,mn+m?). 


Thus | gof (m,n) =(mn+1,mn+m?). 
Note fog(m,n) = f(g(m,n)) = f(m+1,m+n)=(m+1\(m+n),(m+ 1). 
Thus | fog(m,n) =(m?+mn+m+n,m2+2m+1). 


. Consider the functions f :Z x Z— Z defined as f(m,n)=m+n and g:Z—-ZxZ 


defined as g(m)=(m,m). Find the formulas for gof and fog. 
gof(m,n)=(m+n,m+n) 
fog(m)=2m 
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Section 12.5 Exercises 


1. Check that the function f : Z — Z defined by f(n) = 6—-n is bijective. Then 
compute f~!. 
It is injective as follows. Suppose f(m) = f(n). Then 6—-m=6-—n, which reduces 
to m=n. 
It is surjective as follows. If be Z, then f(6—b)=6-—(6—b)=b. 
Inverse: f~!(n)=6-n. 

3. Let B={2”:neZ}={...,4,5,1,2,4,8,...}. Show that the function f : Z — B defined 
as f(n)=2” is bijective. Then find f7!. 
It is injective as follows. Suppose f(m) = f(n), which means 2” = 2”. Taking 
log, of both sides gives log,(2”) = log,(2”), which simplifies to m =n. 
The function f is surjective as follows. Suppose b¢ B. By definition of B this 
means b = 2” for some neé Z. Then f(n)=2” =b. 
Inverse: f~1(n) =logo(n). 

5. The function f:R—R defined as f(x)=2x-e is bijective. Find its inverse. 
Inverse: f~1(x) = one, 

7. Show that the function f :R? — R? defined by the formula f(x, y) = ((x? + Iy,x?) 
is bijective. Then find its inverse. 
First we prove the function is injective. Assume /f(x1,¥1) = f(x2,72). Then 
(x2 + 1)yy = (x2 + Lye and x? = x3. Since the real-valued function f(x) =x? is one- 
to-one, it follows that x; =x». Since x1 =x2, and x +1>0 we may divide both 
sides of (x? + 1)y1 = (x7 + Dye by (x? +1) to get y1 = yo. Hence (x1, 1) = (x2, 2). 
Now we prove the function is surjective. Let (a,b) ¢ R?. Set x =b"* and y= 
ab"? +1). Then f(x,y) =((b7? + lass, (b)?) = (a,b). It now follows that f is 
bijective. 
Finally, we compute the inverse. Write f(x, y)=(u,v). Interchange variables to 
get (x,y) = f(u,v) =((u2 + Lv, u?). Thus x =(u?+1)v and y=u?®. Hence u= y"? and 
v = —3—. Therefore f~ (x,y) = (u,v) = (»¥* 


7] 
v3 41 . ? yl3 4 i 
9. Consider the function f :RxN—N-xR defined as f(x,y) =(y,3xy). Check that 
this is bijective; find its inverse. 


To see that this is injective, suppose f(a,b) = f(c,d). This means (b,3ab) = 
(d,3cd). Since the first coordinates must be equal, we get b=d. As the second 
coordinates are equal, we get 3ab = 3dc, which becomes 3ab = 3bc. Note that, 
from the definition of f, b€ N, so b}#0. Thus we can divide both sides of 
3ab = 3bc by the non-zero quantity 3b to get a=c. Now we have a=c and b=d, 
so (a,b) =(c,d). It follows that f is injective. 

Next we check that f is surjective. Given any (b,c) in the codomain Nx R, notice 
that (35,5) belongs to the domain Rx N, and f(35,6)=(b,c). Thus f is surjective. 
As it is both injective and surjective, it is bijective; thus the inverse exists. 

To find the inverse, recall that we obtained (35,6) =(6,c). Then f-*f (5,0) = 
f-*(6,c), which reduces to (4,b)=f~1(b,c). Replacing 6 and c with x and y, 
respectively, we get f-‘(x,y) =(#,x). 
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Section 12.6 Exercises 


1. Consider the function f :R—R defined as f(x) =x?+3. Find f({-3,5]) and 
f-1((12,19]). Answers: f({—3,5]) = [3,28]; f-1((12, 19]) = [-4, -3] U[3, 4]. 

3. This problem concerns functions f : {1,2,3,4,5,6,7} — {0,1,2,3,4}. How many 
such functions have the property that |f~1({3})|=3? Answer: 4* (3). 

5. Consider a function f:A — B and a subset X <A. We observed in Section 12.6 
that f-'(f(X)) #X in general. However X < f~!(f(X)) is always true. Prove this. 


Proof. Suppose ae X. Thus f(a) « {f(x):x€ X} = f(X), that is f(a) e f(X). Now, 
by definition of preimage, we have f~1(f(X)) = {xe A: f(x) f(X)}. Since ac A 
and f(a)e€ f(X), it follows that a« f~'(f(X)). This proves X < f-1(f(X)). | 


7. Given a function f:A —B and subsets W,X cA, prove f(WnNX)cf(W)n F(X). 


Proof. Suppose b« f(WnX). This means be {f(x):xeWnX}, that is b = f(a) 
for some ae WnX. Since ae W we have b= f(a)€ {f(x):x€ W} = f(W). Since 
ae X we have b= f(a)e {f(x):x€ X} = f(X). Thus b is in both f(W) and f(X), so 
be f(W)nf(X). This completes the proof that f(WnX) > f(W)n F(X). | 


9. Given a function f:A — B and subsets W,X cA, prove f(WUX)= f(W)uU F(X). 


Proof. First we will show f(WUX)¢ f(W)Uf(X). Suppose be f(WUX). This 
means b€ {f(x):xeWUX}, that is, b = f(a) for some ae WUX. Thusae W 
or ace X. If aeW, then b= fla)e {f(x):xe W} = f(W). If ae X, then b= f(a)e 
{f(x):x”€ X}= f(X). Thus 6 is in f(W) or f(X), so be f(W)Uf(X). This completes 
the proof that f(WUX)<¢ f(W)u F(X). 

Next we will show f(W)Uf(X)<c f(WUX). Suppose be f(W)uU f(X). This means 
be f(W) or be f(X). If be f(W), then b = f(a) for some ac W. If be f(X), then 
b= f(a) for some ae X. Either way, b = f(a) for some a that is in W or X. That 
is, b=f(a) for some ae WUX. But this means be f(WUX). This completes the 
proof that f(W)u f(X)<¢ f(WUX). 

The previous two paragraphs show f(WUX) = f(W)Uf(X). a 


11. Given f:A—B and subsets Y,Z CB, prove f-'(Y UZ) = f-'(Y)uf7(Z). 


Proof. First we will show f-'(YuUZ)<¢ f-'(Y)uf-\(Z). Suppose ae f-'(Y UZ). 
By Definition 12.9, this means f(a)—e YUZ. Thus, f(a)e Y or f(a)eZ. If 
f(a)€ Y, then ae f-'(Y), by Definition 12.9. Similarly, if f(a) « Z, then a¢ 
f-\(Z). Hence ace f-'(Y) or ae f-(Z), soae f (Y)Uf-Z). Consequently 
f\(YuZ)cf\vY)uf Z). 

Next we show f-'(Y)uf-'(Z)cf-'(YuUZ). Suppose ae f-'(Y)uf-(Z). This 
means ace f-'(Y) or ae f (Z). Hence, by Definition 12.9, f(a)€Y or fla)€ Z, 
which means f(a)€ YUZ. But by Definition 12.9, f(a)« YUZ means ae f~\(YuZ). 
Consequently f-'(Y)u f-'(Z) ¢ f-W(Y¥ UZ). 

The previous two paragraphs show f~\(YUZ)=f-'(Y)uf7(Z). | 
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13. Let f:A—B be a function, and X <A. Prove or disprove: f(f~1(f(X))) = f(X). 


Proof. First we will show f(f~(f(X))) ¢ f(X). Suppose ye f(f-(f(X))). By 
definition of image, this means y= f(x) for some xe f~'(f(X)). But by definition 
of preimage, x « f~!(f(X)) means f(x) f(X). Thus we have y= f(x) f(X), as 
desired. 


Next we show f(X)¢ f(f 1 (X))). Suppose ye f(X). This means y= f(x) for 
some x¢€ X. Then f(x)= ye f(X), which means x « f~!(f(X)). Then by definition 
of image, f(x) « f(f-(f(X))). Now we have y= f(x)€ f(f-(f(X))), as desired. 


The previous two paragraphs show f (f (F(X) = F(X). a 


Chapter 13 Exercises 


Section 13.1 Exercises 


1. R and (0,00) 
Observe that the function f(x) = e* sends R to (0,00). It is injective because 
f(x) = f(y) implies e* = e’, and taking In of both sides gives x= y. It is surjective 
because if b € (0,00), then f(n(b)) = 6. Therefore, because of the bijection 
f :R— (0,00), it follows that |R| = |(0,00)|. 


3. R and (0,1) 
Observe that the function if (x) = cot" (x) sends R to (0,1). It is injective and 
surjective by elementary trigonometry. Therefore, because of the bijection 
f :R— (0,1), it follows that |R| = |(0,1)|. 

5. A={8k:keEZ} and B={7Tk:keZ} 
Observe that the function f(x) = tx sends A to B. It is injective because 
f(x) = f(y) implies {x = 7y, and multiplying both sides by 3 gives x= y. It is 
surjective because if b¢« B, then b= 7k for some integer k. Then 3ke A, and 
f(8k)=7k=b. Therefore, because of the bijection f:A— B, it follows that 
|A| = |B. 

7. Zand S={...,3,4,5,1,2,4,8, 16,...} 
Observe that the function f : Z— S defined as f(n) = 2” is bijective: It is injective 
because f(m) = f(n) implies 2” = 2”, and taking log, of both sides produces m =n. 
It is surjective because any element b of S has form b = 2” for some integer n, 
and therefore f(n) = 2” = 6. Because of the bijection f :Z—S, it follows that 
|Z| =|S]. 

9. {(0,1}xNandN 
Consider the function f : {0,1}x N—N defined as f(a,n) =2n-—a. This is injective 
because if f(a,n) = f(b,m), then 2n-a =2m-—b. Now if a were unequal to b, one 
of a or b would be O and the other would be 1, and one side of 2n-a=2m-—b 
would be odd and the other even, a contradiction. Therefore a=b. Then 
2n—-a=2m-—b becomes 2n—-a=2m-—a; add a to both sides and divide by 2 to 
get m=n. Thus we have a=b and m=n, so (a,n) =(b,m), so f is injective. 


11. 


13. 


15. 
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To see that f is surjective, take any beN. If b is even, then b =2n for some 
integer n, and f(0,n)=2n-—0=b. If 6 is odd, then 6=2n+1 for some integer n. 
Then f(1,n+1)=2(n+1)-—1=2n+1=b. Therefore f is surjective. Then f isa 
bijection, so |{0,1} x N| =|Ni. 


[0,1] and (0,1) 

Proof. Consider the subset X = {i :n€N}c[0,1]. Let f :[0,1]— [0,1) be defined 
as f(x)=«x if xe [0,1]—X and f(4)=—, for any +€X. It is easy to check that 
f is a bijection. Next let Y = {1- + :n€N}c[0,1), and define g:[0,1) — (0,1) as 
g(x) =x if x€[0,1)-Y and g(1-+)=1-—, for any 1-4 Y. As in the case of /, it 
is easy to check that g is a bijection. Therefore the composition gof :[0,1]— (0,1) 
is a bijection. (See Theorem 12.2.) We conclude that |[0,1]| = |(0,1)I. | 


YA(N) and A(Z) 

Outline: By Exercise 18 of Section 12.2, we have a bijection f : N— Z defined as 
f(ny= oe. Now define a function ®@: A(N) > A(Z) as O(X) = {f (x): 
x€X}. Check that © is a bijection. 


Find a formula for the bijection f in Example 13.2. 
Hint: Consider the function f from Exercise 18 of Section 12.2. 


Section 13.2 Exercises 


1. 


Prove that the set A = {In(n):n € N}CR is countably infinite. 
Just note that its elements can be written in infinite list form as In(1),1n(2),1n(3),---. 
Thus A is countably infinite. 


. Prove that the set A = {(5n,—3n):n€ Z} is countably infinite. 


Consider the function f : Z— A defined as f(n) = (5n,-3n). This is clearly 
surjective, and it is injective because f(n) = f(m) gives (5n,—3n) =(5m,—3m), so 
5n=5m, hence m=n. Thus, because f is surjective, |Z|=|A|, and |A| =|Z|=No. 
Therefore A is countably infinite. 


. Prove or disprove: There exists a countably infinite subset of the set of irrational 


numbers. 
This is true. Just consider the set consisting of the irrational numbers 


amit uw 
12937497 


. Prove or disprove: The set Q!°° is countably infinite. 


This is true. Note Q!° = Qx Q@x---x Q@ (100 times), and since Q is countably 
infinite, it follows from the corollary of Theorem 13.5 that this product is 
countably infinite. 


. Prove or disprove: The set {0,1} x N is countably infinite. 


This is true. Note that {0,1} x N can be written in infinite list form as 
(0, 1),(1, 1), (0, 2), (1, 2), (0, 3), (1,3), (0,4), (1, 4),---. Thus the set is countably infinite. 
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11. Partition N into 8 countably infinite sets. 


For each i € {1,2,3,4,5,6,7,8}, let X; be those natural numbers that are congruent 
to i modulo 8, that is, 


X, = {1,9,17,25,33,...} 

Xq = {2,10,18,26,34,...} 
X3 = {8,11,19,27,35,...} 
X4 = {4,12,20,28,36,...} 
X5 = {5,13,21,29,37,...} 
Xe = {6,14,22,30,38,...} 
X7 = {7,15,13,31,39,...} 
Xg = {8,16,24,32,40,...} 


13. If A={X CN:X is finite }. Then |A| =No. 


Proof. This is true. To show this we will describe how to arrange the items of 
A in an infinite list X1,X2,X3,Xa,.... 


For each natural number n, let p, be the nth prime number, that is p; = 
2,p2 = 3,p3 = 5,p4 =7, and so on. Now consider any element X € A, so X = 
{n1,N9,N3,...,.np~}, Where k =|X| and n; €N for each 1<i<k. Define a function 
f:A—N as follows: f(X)= pn, Pny***Pn,-. That is, we treat X ¢ A as an “index" 
for the prime sequence and just map the entire set to the product of all the 
primes with corresponding index. For example, take the set X = {1,2,3}. Then 
f(X) = fC, 2,3}) = pipep3 =2-3-5=30. 

Note that f is an injection from A to N. Assume f(X) = f(Y). Then, by definition 
of the function, f(X) = pn, Pny°*:Pn,- Similarly, f(Y) = pm,Pms°*:Pm,. By the 
fundamental theorem of arithmetic, these are the prime decompositions of each 
f(X) and f(Y). Furthermore, the fundamental theorem guarantees that these 
decompositions are unique. Hence {n1,ng,...,nz}={m1,mo,...,ms} or X=Y. 


This means each finite set X CN is associated with a unique natural number 
f(X). Thus we can list the elements in X in A in increasing order of the 
numbers f(X). The first several terms of this list would be 


{1}, {2}, {3}, {1,2}, {4}, {1,3}, {5}, 16}, {1,4}, {2,3}, {7}... 


It follows that A is countably infinite. a 


Section 13.3 Exercises 


1. Suppose B is an uncountable set and A is a set. Given that there is a surjective 
function f:A— B, what can be said about the cardinality of A? 
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The set A must be uncountable, as follows. For each b« B, let a, be an 
element of A for which f(a,)=6. (Such an element must exist because f is 
surjective.) Now form the set U = {a,:b¢B}. Then the function f:U—B is 
bijective, by construction. Then since B is uncountable, so is U. Therefore U is 
an uncountable subset of A, so A is uncountable by Theorem 13.9. 


. Prove or disprove: If A is uncountable, then |A| = |R|. 


This is false. Let A= A(R). Then A is uncountable, and by Theorem 13.7, 
IR| < | A(R)| = |Al. 
. Prove or disprove: The set {0,1} x R is uncountable. 


This is true. To see why, first note that the function f :R— {0}x R defined as 
f(x)=(0,x) is a bijection. Thus |R| =|{0} x R|, and since R is uncountable, so is 
{0}x R. Then {0}x R is an uncountable subset of the set {0,1} xR, so {0,1}xR is 
uncountable by Theorem 13.9. 


. Prove or disprove: If ACB and A is countably infinite and B is uncountable, 
then B-A is uncountable. 


This is true. To see why, suppose to the contrary that B—A is countably infinite. 
Then B = AvU(B-—A) is a union of countably infinite sets, and thus countable, 
by Theorem 13.6. This contradicts the fact that B is uncountable. 
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